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NOVEL G-CSF RECEPTOR AGONISTS 

The present application claims priority under 35 USC 
§119 (e) of United States provisional application Serial No. 
5 60/004,382 filed October 05, 1995. 

Field ot the Invention 
The present invention relates to human G-CSF receptor 
agonists with activity on hematopoietic cell differentiation 
10 and expansion. 

Background of the Invention 

The human blood-forming (hematopoietic) system 

15 replaces a variety of white blood cells (including 

neutrophils, macrophages, and basophils/mast cells), red 
blood cells (erythrocytes) and clot-forming cells 
(megakaryocytes/platelets) . The hematopoietic systems of 
the average male has been estimated to produce on the order 

20 of 4.5 x 10 11 granulocytes and erythrocytes every year, 

which is equivalent to an annual replacement of total body 
weight (Defter et al . , BioEssays, 2;154-158, 1985). 

It is believed that small amounts of certain 
hematopoietic growth factors account for the differentiation 

25 of a small number of progenitor "stem cells" into the 
variety of blood cell lines, for the tremendous 
proliferation of those lines, and for the ultimate 
differentiation of mature blood cells from those lines. 
Because the hematopoietic growth factors are present in 

30 extremely small amounts, the detection and identification of 
these factors has relied upon an array of assays which as 
yet only distinguish among the different factors on the 
basis of stimulative effects on cultured cells under 
artificial conditions. 
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U.S. Patent 4,999,291 discloses DNA and methods for 
making G-CSF the disclosure of which is incorporated herein 
by reference in it entirety. 

5 U.S. Patent 4,810,643 relates to DNA and methods of 

making G-CSF and Cys to Ser substitution variants of G-CSF. 

Kuga et al . {Biochem. + Biophys . Res. Comm. 159:103- 
111, 1988) made a series of G-CSF variants to partially 
10 define the structure-function relationship. Kuga et al. 
found that internal and C-terminal deletions abolished 
activity, while N-terminal deletions of up to 11 amino acids 
and amino acid substitutions at positions 1, 2 and 3 were 
active. 

15 

Watanabe et al . (Anal. Biochem. 195:38-44, 1991) made a 
variant to study G-CSF receptor binding in which amino acids 
1 and 3 were changed to Tyr for radioiodination of the 
protein. Watanabe et al. found this Tyr 1 , Tyr 3 G-CSF variant 
20 to be active. 

WO 95/27732 describes, but does not show that the 
molecule has biological activity, a circularly permuted G- 
CSF ligand with a breakpoint at positions 68/69 creating a 
25 circularly permuted G-CSF ligand with a new N-terminus at 
the original position 69 of G-CSF and a new C-terminus at 
the original position 68 of G-CSF. WO 95/27732 also 
discloses circularly permuted GM-CSF, IL-2 and IL-4. 

30 Rearrangement of Protein Sequences 

In evolution, rearrangements of DNA sequences serve an 
important role in generating a diversity of protein 
structure and function. Gene duplication and exon shuffling 
35 provide an important mechanism to rapidly generate diversity 
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and thereby provide organisms with a competitive advantage, 
especially since the basal mutation rate is low (Doolittle, 
Protein Science 1:191-200, 1992). 

The development of recombinant DNA methods has made it 
5 possible to study the effects of sequence transposition on 
protein folding, structure and function. The approach used 
in creating new sequences resembles that of naturally 
occurring pairs of proteins that are related by linear 
reorganization of their amino acid sequences (Cunningham, et 

10 al., Proc. Natl. Acad. Sci. U.S.A. 76:3218-3222, 1979; 

Teather & Erfle, J. Bacteriol. 172: 3837-3841, 1990; 
Schimming et al . , Eur. J. Biochem. 204: 13-19, 1992; 
Yamiuchi and Minamikawa, FEBS Lett. 260:127-130, 1991: 
MacGregor et al., FEBS Lett . 378:263-266, 1996). The first 

15 in vitro application of this type of rearrangement to 

proteins was described by Goldenberg and Creighton (J. Mol. 
Biol. 165:407-413, 1983). A new N-terminus is selected at 
an internal site (breakpoint) of the original sequence, the 
new sequence having the same order of amino acids as the 

20 original from the breakpoint until it reaches an amino acid 
that is at or near the original C-terminus. At this point 
the new sequence is joined, either directly or through an 
additional portion of sequence (linker), to an amino acid 
that is at or near the original N-terminus, and the new 

25 sequence continues with the same sequence as the original 
until it reaches a point that is at or near the amino acid 
that was N-terminal to the breakpoint site of the original 
sequence, this residue forming the new C-terminus of the 
chain. 

30 This approach has been applied to proteins which range 

in size from 58 to 462 amino acids (Goldenberg & Creighton, 
J. Mol. Biol. 165:407-413, 1983; Li & Coffino, Mol . Cell. 
Biol. 13:2377-2383, 1993). The proteins examined have 
represented a broad range of structural classes, including 

35 proteins that contain predominantly a-helix ( inter leukin-4; 
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Kreitman et al . , Cytokine 7 : 311-318 , 1995), (*sheet 
(interleukin-1 ; Horlick et al . , Protein Eng. 5:427-431, 
1992), or mixtures of the two (yeast phosphoribosyl 
anthranilate isomerase; Luger et al., Science 243:206-210, 
5 1989) . Broad categories of protein function are represented 
in these sequence reorganization studies: 



10 



T4 lysozyme 



Zhang et al . , Biochemistry 
32:12311-12318 (1993); Zhang et 
al., Nature Struct. Biol. 1:434-438 
(1995) 



15 dihydrofolate 
reductase 



Buchwalder et al., Biochemistry 
31:1621-1630 (1994); Protasovaet 
al., Prot. Eng. 7:1373-1377 (1995) 



20 



ribonuclease Tl 



Mullins et al., J. Am. Chem. Soc. 
116:5529-5533 (1994); Garrett et al., 
Protein Science 5:204-211 (1996) 



25 



Bacillus |*-glucanse Hahn et al., Proc. Natl. Acad. Sci. 

U.S.A. 91:10417-10421 (1994) 

aspartate Yang & Schachman, Proc. Natl. Acad. 

transcarbamoylase Sci. U.S.A. 90:11980-11984 (1993) 



phosphoribosyl Luger et al., Science 243:206-210 

30 anthranilate (1989); Luger et al., Prot. Eng. 

isomerase 3:249-258 (1990) 

pepsin/pepsinogen Lin et al., Protein Science 4:159- 

166 (1995) 



35 
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glyceraldehyde-3- Vignais et al., Protein Science 

phosphate dehydro- 4:994-1000 (1995) 

genase 



ornithine 
decarboxylase 



Li St Coffino, Mol . Cell. Biol 
13:2377-2383 (1993) 



10 



yeast 

phosphoglycerate 
dehydrogenase 



Ritco-Vonsovici et al., Biochemistry 
34:16543-16551 (1995) 



Enzyme Inhibitor 



15 



basic pancreatic Goldenberg & Creighton, J. Mol 
trypsin inhibitor Biol. 165:407-413 (1983) 



20 



Cytokines 



interleuki.» -lp 



Horlick et al., Protein Eng. 5:427- 
431 (1992) 



interleukin-4 



Kreitman et al., Cytokine 7:311- 
318 (1995) 



25 Tyrosine Kinase 

Recognition Domain 



30 



ct-spectrin SH3 
domain 

Transmembrane 
Protein 



Viguera, et al . , J. 

Mol. Biol. 247:670-681 (1995) 



35 



omp A 



Koebnik & Kramer, J. Mol. Biol. 
250:617-626 (1995) 
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Chimeric Protein 

interleukin-4- Kreitman et al. f Proc. Natl. Acad. 

5 Pseudomonas Sci. U.S.A. 91:6889-6893 (1994). 

exotoxin fusion 
molecule 

The results of these studies have been highly variable. 

10 In many cases substantially lower activity, solubility or 

thermodynamic stability were observed (£. coli dihydrof olate 
reductase, aspartate transcarbamoylase, phosphoribosyl 
anthranilate isomerase, glyceraldehyde-3 -phosphate 
dehydrogenase, ornithine decarboxylase, omp A, yeast 

15 phosphoglycerate dehydrogenase) . In other cases, the 

sequence rearranged protein appeared to have many nearly 
identical properties as its natural counterpart (basic 
pancreatic trypsin inhibitor, T4 lysozyme, ribonuclease Tl, 
Bacillus P-glucanase, interleukin-lp, aspectrin SH3 domain, 

20 pepsinogen, interleukin-4 ) . In exceptional cases, an 

unexpected improvement over some properties of the natural 
sequence was observed, e.g., the solubility and refolding 
rate for rearranged a-spectrin SH3 domain sequences, and the 
receptor affinity and anti-tumor activity of transposed 

25 interleukin-4-Pseudomonas exotoxin fusion molecule (Kreitman 

et al., Proc. Natl. Acad. Sci. U.S.A. 91:6889-6893, 1994; 
Kreitman et al . , Cancer Res. 55:3357-3363, 1995). 

The primary motivation for these types of studies has 
been to study the role of short-range and long-range 

30 interactions in protein folding and stability. Sequence 

rearrangements of this type convert a subset of interactions 
that are long-range in the original sequence into short- 
range interactions in the new sequence, and vice versa. The 
fact that many of these sequence rearrangements are able to 

35 attain a conformation with at least some activity is 

persuasive evidence that protein folding occurs by multiple 
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folding pathways (Viguera, et al. # J. Mol . Biol. 247:670- 
681, 1995) . In the case of the SH3 domain of otspectrin, 
choosing new termini at locations that corresponded to ($ 
-hairpin turns resulted in proteins with slightly less 
5 stability, but which were nevertheless able to fold. 

The positions of the internal breakpoints used in the 
studies cited here are found exclusively on the surface of 
proteins, and are distributed throughout the linear sequence 
without any obvious bias towards the ends or the middle (the 

10 variation in the relative distance from the original N- 
terminus to the breakpoint is ca. 10 to 80% of the total 
sequence length) . The linkers connecting the original N- and 
C-termini in these studies have ranged from 0 to 9 residues. 
In one case (Yang & Schachman, Proc. Natl. Acad. Sci. U.S.A. 

15 90:11980-11984, 1993), a portion of sequence has been 
deleted from the original C-terminal segment, and the 
connection made from the truncated C- terminus to the 
original N -terminus. Flexible hydrophilic residues such as 
Gly and Ser are frequently used in the linkers. Viguera, et 

20 al.(J. Mol. Biol. 247:670-681, 1995) compared joining the 

original N- and C- termini with 3- or 4-residue linkers; the 
3 -residue linker was less thermodynamically stable. 
Protasova et al . {Protein Eng. 7:1373-1377, 1994) used 3- or 
5-residue linkers in connecting the original N-termini of E. 

25 coli dihydrofolate reductase; only the 3 -residue linker 
produced protein in good yield. 
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Summary of the T nvention 



5 The modified human G-CSF receptor agonists of the 

present invention can be represented by the Formula: 

xl-(L) a -X 2 

10 wherein; 

a is 0 or 1; 

X 1 is a peptide comprising an amino acid sequence 
corresponding to the sequence of residues n+1 through J; 

X 2 is a peptide comprising an amino acid sequence 
15 corresponding to the sequence of residues 1 through n; 

n is an integer ranging from 1 to J-l; and 
L is a linker. 

In the formula above the constituent amino acids 
20 residues of human G-CSF are numbered sequentially 1 through 
J from the amino to the carboxyl terminus. A pair of 
adjacent amino acids within this protein may be numbered n 
and n+1 respectively where n is an integer ranging from 1 to 
J-l. The residue n+1 becomes the new N-terminus of the new 
25 G-CSF receptor agonist and the residue n becomes the new C- 
terminus of the new G-CSF receptor agonist. 



30 



The present invention relates to novel G-CSF receptor 
agonists of the following formula: 

1 10 

Xaa Xaa Xaa Gly Pro Ala Ser Ser Leu Pro Gin Ser Xaa 

35 20 

Leu Leu Xaa Xaa Xaa Glu Gin Val Xaa Lys Xaa Gin Gly Xaa Gly 

30 40 
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Ala Xaa Leu Gin Glu Xaa Leu Xaa Ala Thr Tyr Lys Leu Xaa Xaa 

50 

Xaa Glu Xaa Xaa Val Xaa Xaa Gly His Ser Xaa Gly lie Pro Trp 

5 

60 70 
Ala Pro Leu Ser Ser Xaa Pro Ser Xaa Ala Leu Xaa Leu Ala Gly 

80 

10 Xaa Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu 

90 100 
Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr Leu 

15 110 

Xaa Thr Leu Gin Xaa Asp Val Ala Asp Phe Ala Xaa Thr lie Trp 



20 



120 130 

Gin Gin Met Glu Xaa Xaa Gly Met Ala Pro Ala Leu Gin Pro Thr 

140 

Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Xaa Gin Xaa Xaa Ala 



150 160 
25 Gly Gly Val Leu Val Ala Ser Xaa Leu Gin Xaa Phe Leu Xaa Xaa 

170 

Ser Tyr Arg Val Leu Xaa Xaa Leu Ala Gin Pro (SEQ ID NO:l> 

30 wherein 

Xaa at position 1 is Thr, Ser, Arg, Tyr or Gly; 

Xaa at position 2 is Pro or Leu; 

Xaa at position 3 is Leu, Arg, Tyr or Ser; 
35 Xaa at position 13 is Phe, Ser, His, Thr or Pro; 

Xaa at position 16 is Lys, Pro, Ser, Thr or His; 

Xaa at position 17 is Cys, Ser, Gly, Ala, lie, Tyr or Arg; 

Xaa at position 18 is Leu, Thr, Pro, His, lie or Cys; 

Xaa at position 22 is Arg, Tyr, Ser, Thr or Ala; 
40 Xaa at position 24 is lie, Pro, Tyr or Leu; 

Xaa at position 27 is Asp, or Gly; 

Xaa at position 30 is Ala, lie, Leu or Gly; 

Xaa at position 34 is Lys or Ser; 

Xaa at position 36 is Cys or Ser; 
45 Xaa at position 42 is Cys or Ser; 

Xaa at position 43 is His, Thr, Gly, Val, Lys, Trp, Ala, 
Arg, Cys, or Leu; 

Xaa at position 44 is Pro, Gly, Arg, Asp, Val, Ala, His, 
Trp, Gin, or Thr; 
50 Xaa at position 46 is Glu, Arg, Phe, Arg, lie or Ala; 

Xaa at position 47 is Leu or Thr; 

Xaa at position 49 is Leu, Phe, Arg or Ser; 
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Xaa at position 50 is Leu, lie, His, Pro or Tyr; 

Xaa at position 54 is Leu or His; 

Xaa at position 64 is Cys or Ser; 

Xaa at position 67 is Gin, Lys, Leu or Cys; 

5 Xaa at position 70 is Gin, Pro, Leu, Arg or Ser; 

Xaa at position 74 is Cys or Ser; 

Xaa at position 104 is Asp, Gly or Val; 

Xaa at position 108 is Leu, Ala, Val, Arg, Trp, Gin or Gly; 

Xaa at position 115 is Thr, His, Leu or Ala; 

10 Xaa at position 120 is Gin, Gly, Arg, Lys or His 

Xaa at position 123 is Glu, Arg, Phe or Thr 

Xaa at position 144 is Phe, His, Arg, Pro, Leu, Gin or Glu; 

Xaa at position 146 is Arg or Gin; 

Xaa at position 147 is Arg or Gin; 

15 Xaa at position 156 is His, Gly or Ser; 

Xaa at position 159 is Ser, Arg, Thr, Tyr, Val or Gly; 

Xaa at position 162 is Glu, Leu, Gly or Trp; 

Xaa at position 163 is Val, Gly, Arg or Ala; 

Xaa at position 169 is Arg, Ser, Leu, Arg or Cys; 

20 Xaa at position 170 is His, Arg or Ser; 

wherein optionally 1-11 amino acids from the N-terminus and 
1-5 from the C- terminus can be deleted; and 

25 wherein the N-terminus is joined to the C-terminus directly 
or through a linker capable of joining the N-terminus to the 
C-terminus and having new C- and N- termini at amino acids; 



30 


38-39 


62-63 


123-124 




39-40 


63-64 


124-125 




40-41 


64-65 


125-126 




41-42 


65-66 


126-127 




42-43 


66-67 


128-129 


35 


43-44 


67-68 


128-129 




45-46 


68-69 


129-130 




48-49 


69-70 


130-131 




49-50 


70-71 


131-132 




52-53 


71-72 


132-133 


40 


53-54 


91-92 


133-134 




54-55 


92-93 


134-135 




55-56 


93-94 


135-136 




56-57 


94-95 


136-137 




57-58 


95-96 


137-138 


45 


58-59 


96-97 


138-139 




59-60 


97-98 


139-140 




60-61 


98-99 


140-141 




61-62 


99-100 


141-142 

or 142-143. 
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The G-CSF receptor agonists of the present invention 
may contain amino acid substitutions, deletions and/or 
insertions and may also have amino acid deletions at 
5 either/or both the N- and C- termini. 

The more preferred breakpoints at which new C-terminus 
and N-terminus can be made are; 38-39, 39-40, 40-41, 41-42, 
48-49, 53-54, 54-55, 55-56, 56-57, 57-58, 58-59, 59-60, 60- 
10 61, 61-62, 62-63, 64-65, 65-66, 66-67, 67-68, 68-69, 69-70, 

96-97, 125-126, 126-127, 127-128, 128-129, 129-130, 130-131, 
131-132, 132-133, 133-134, 134-135, 135-136, 136-137, 137- 
138, 138-139, 139-140, 140-141 and 141-142. 

15 The most preferred breakpoints at which new C-terminus 

and N-terminus can be made are; 38-39, 48-49, 96-97, 125- 
126, 132-133 and 141-142. 

A preferred embodiment of the present invention the 
20 linker (L) joining the N-terminus to the C-terminus is a 
polypeptide selected from the group consisting of: 

GlyGlyGlySer (SEQ ID NO: 2) ; 

GlyGlyGlySerGlyGlyGlySer (SEQ ID NO: 61); 

GlyGlyGlySerGlyGlyGlySerGlyGlyGlySer (SEQ ID NO: 62); 
25 SerGlyGlySerGlyGlySer (SEQ ID NO:63); 

GluPheGlyAsnMet (SEQ ID NO: 64); 

GluPheGlyGlyAsnMet (SEQ ID NO: 65); 

GluPheGlyGlyAsnGlyGlyAsnMet (SEQ ID NO: 66); and 

GlyGlySerAspMetAlaGly (SEQ ID NO: 67). 

30 

The present invention also encompasses recombinant 
human G-CSF receptor agonists co-administered or 
sequentially with one or more additional colony stimulating 
factors (CSF) including, cytokines, lymphokines, 
35 interleukins, hematopoietic growth factors which include but 
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are not limited to GM-CSF, c-mpl ligand (also known as TPO 
or MGDF) , M-CSF, erythropoietin (EPO) , IL-1, IL-4, IL-2, IL- 
3, IL-5, IL 6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, 
IL-15, LIF, flt3/flk2 ligand, human growth hormone, B-cell 
5 growth factor, B-cell differentiation factor, eosinophil 

differentiation factor and stem cell factor (SCF) also known 
as steel factor or c-kit ligand (herein collectively 
referred to as "colony stimulating factors" or W CSF"). These 
co-administered mixtures may be characterized by having the 

10 usual activity of both of the peptides or the mixture may be 
further characterized by having a biological or 
physiological activity greater than simply the additive 
function of the presence of the G-CSF receptor agonists or 
the second colony stimulating factor alone. The co- 

15 administration may also provide an enhanced effect on the 

activity or an activity different from that expected by the 
presence of the G-CSF ligand or the second colony 
stimulating factor. The co-administration may also have an 
improved activity profile which may include reduction of 

20 undesirable biological activities associated with native 
human G-CSF. In addition to the list above, IL-3 variants 
taught in WO 94/12639 and WO 94/12638 can be co-administered 
with the polypeptides of the present invention. 

25 In addition, it is envisioned that in vitro uses would 

include the ability to stimulate bone marrow and blood cell 
activation and growth before the expanded cells are infused 
into patients 
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RriPf DPscrintion of the Figures 

Figure 1 schematically illustrates the sequence 
5 rearrangement of a protein. The N-terminus (N) and the C- 
terminus (C) of the native protein are joined through a 
linker, or joined directly. The protein is opened at a 
breakpoint creating a new N-terminus (new N) and a new C- 
terminus <new-C) resulting in a protein with a new linear 
10 amino acid sequence. A rearranged molecule may be 

synthesized de novo as linear molecule and not go through 
the steps of joining the original N-terminus and the C- 
terminus and opening of the protein at the breakpoint. 

15 Figure 2 shows a schematic of Method I, for creating 

new proteins in which the original N-terminus and C-terminus 
of the native protein are joined with a linker and different 
N-terminus and C-terminus of the protein are created. In the 
example shown the sequence rearrangement results in a new 

20 gene encoding a protein with a new N-terminus created at 
amino acid 97 of the original protein, the original C- 
terminus (a. a. 174) joined to the amino acid 11 (a. a. 1- 10 
are deleted) through a linker region and a new C-terminus 
created at amino acid 96 of the original sequence. 

25 

Figure 3 shows a schematic of Method II, for creating 
new proteins in which the original N-terminus and C-terminus 
of the native protein are joined without a linker and 
different N-terminus and C-terminus of the protein are 

30 created. In the example shown the sequence rearrangement 
results in a new gene encoding a protein with a new N- 
terminus created at amino acid 97 of the original protein, 
the original C-terminus (a. a. 174) joined to the original N- 
terminus and a new C-terminus created at amino acid 96 of 

35 the original sequence. 
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Figure 4 shows a schematic of Method III, for creating 
new proteins in which the original N-terminus and C-terminus 
of the native protein are joined with a linker and different 
5 N-terminus and C-terminus of the protein are created. In the 
example shown the sequence rearrangement results in a new 
gene encoding a protein with a new N-terminus created at 
amino acid 97 of the original protein, the original C- 
terminus (a. a. 174) joined to amino acid 1 through a linker 
10 region and a new C-terminus created at amino acid 96 of the 
original sequence. 
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Detailed Description of the Invention 

Receptor agonists of the present invention may be 
5 useful in the treatment of diseases characterized by 
decreased levels of granulocytes of the hematopoietic 
system. 

A G-CSF receptor agonist may be useful in the treatment 
or prevention of neutropenia. Many drugs may cause bone 

10 marrow suppression or hematopoietic deficiencies. Examples 
of such drugs are AZT, DDI, alkylating agents and anti- 
metabolites used in chemotherapy, antibiotics such as 
chloramphenicol, penicillin, gancyclovir, daunomycin and 
sulfa drugs, phenothiazones , tranquilizers such as 

15 meprobamate, analgesics such as aminopyrine and dipyrone, 
anti-convulsants such as phenytoin or carbamazepine, 
antithyroids such as propylthiouracil and methimazole and 
diuretics. G-CSF receptor agonists may be useful in 
preventing or treating the bone marrow suppression or 

20 hematopoietic deficiencies which often occur in patients 
treated with these drugs. 

Hematopoietic deficiencies may also occur as a result 
of viral, microbial or parasitic infections and as a result 
of treatment for renal disease or renal failure, e.g., 

25 dialysis. The present peptide may be useful in treating 
such hematopoietic deficiency. 

Another aspect of the present invention provides 
plasmid DNA vectors for use in the method of expression of 
these novel G-CSF receptor agonists. These vectors contain 

30 the novel DNA sequences described above which code for the 
novel polypeptides of the invention. Appropriate vectors 
which can transform host cells capable of expressing the G- 
CSF receptor agonists include expression vectors comprising 
nucleotide sequences coding for the G-CSF receptor agonists 

35 joined to transcriptional and translational regulatory 
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sequences which are selected according to the host cells 
used. Vectors incorporating modified sequences as described 
above are included in the present invention and are useful 
in the production of the modified G-CSF receptor agonist 
5 polypeptides. The vector employed in the method also 
contains selected regulatory sequences in operative 
association with the DNA coding sequences of the invention 
and capable of directing the replication and expression 
thereof in selected host cells. 

10 As another aspect of the present invention, there is 

provided a novel method for producing the novel family of 
human G-CSF receptor agonists. The method of the present 
invention involves culturing suitable cells or cell line, 
which has been transformed with a vector containing a DNA 

15 sequence coding for expression of the novel G-CSF receptor 
agonist polypeptide. Suitable cells or cell lines may 
include various strains of bacteria such as E. coli, yeast, 
mammalian cells, or insect cells may be utilized as host 
cells in the method of the present invention. 

20 

Other aspects of the present invention are methods and 
therapeutic compositions for treating the conditions 
referred to above. Such compositions comprise a 
therapeutically effective amount of one or more of the G-CSF 

25 receptor agonists of the present invention in a mixture with 
a pharmaceutical^ acceptable carrier. This composition can 
be administered either parent erally, intravenously or 
subcutaneously . When administered, the therapeutic 
composition for use in this invention is preferably in the 

30 form of a pyrogen-f ree, parenterally acceptable aqueous 

solution. The preparation of such a parenterally acceptable 
protein solution, having due regard to pH, isotonicity, 
stability and the like, is within the skill of the art. 
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The dosage regimen involved in a metnod for treating 
the above-described conditions will be determined by the 
attending physician considering various factors which modify 
the action of drugs, e.g. the condition, body weight, sex 
5 and diet of the patient, the severity of any infection, time 
of administration and other clinical factors. Generally, a 
daily regimen may be in the range of 0.5 - 150 |ig/kg of non- 
glycosylated G-CSF receptor agonists protein per kilogram of 
body weight. Dosages would be adjusted relative to the 

10 activity of a given receptor agonist and it would not be 

unreasonable to note that dosage regimens may include doses 
as low as 0.1 microgram and as high as 1 milligram per 
kilogram of body weight per day. In addition, there may 
exist specific circumstances where dosages of G-CSF receptor 

15 agonist would be adjusted higher or lower than the range of 
0.5 - 150 micrograms per kilogram of body weight. These 
include co-administration with other CSF or growth factors; 
co-administration with chemotherapeutic drugs and/or 
radiation; the use of glycosylated G-CSF receptor agonists; 

20 and various patient-related issues mentioned earlier in this 
section. As indicated above, the therapeutic method and 
compositions may also include co-administration with other 
human factors. A non-exclusive list of other appropriate 
hematopoietins, CSFs and interleukins for simultaneous or 

25 serial co-administration with the polypeptides of the 

present invention includes GM-CSF, c-mpl ligand (also known 
as TPO or MGDF) , M-CSF, erythropoietin (EPO) , IL-1, IL-4, 
IL-2, IL-3, IL-5, IL 6, IL-7, IL-8, IL-9, IL-10, IL-11, IL- 
12, IL-13, IL-15, LIF, flt3/flk2 ligand, human growth 

30 hormone, B-cell growth factor, B-cell differentiation 

factor, eosinophil differentiation factor and stem cell 
factor (SCF) also known as steel factor or c-kit ligand 
(herein collectively referred to as "colony stimulating 
factors'), or combinations thereof. In addition to the list 

35 above, IL-3 variants taught in WO 94/12639 and WO 94/12638 
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can be co-administered with the polypeptides of the present 
invention. 

The G-CSF receptor agonists of the present invention 
5 may be useful in the mobilization of hematopoietic 

progenitors and stem cells in peripheral blood. Peripheral 
blood derived progenitors have been shown to be effective in 
reconstituting patients in the setting of autologous marrow 
transplantation. Hematopoietic growth faccors, including G- 

10 CSF and GM-CSF, have been shown to enhance the number of 
circulating progenitors and stem cells in the peripheral 
blood. This has simplified the procedure for peripheral stem 
cell collection and dramatically decreased the cost of the 
procedure by decreasing the number of pheresis required. The 

15 G-CSF receptor agonist of the present invention may be 

useful in mobilization of stem cells and further enhance the 
efficacy of peripheral stem cell transplantation. 

The G-CSF receptor agonists of the present invention 

20 may also be useful in the ex vivo expansion of hematopoietic 
progenitors. Colony stimulating factors <CSFs), such as G- 
CSF, have been administered alone, co-administered with 
other CSFs, or in combination with bone marrow transplants 
subsequent to high dose chemotherapy to treat the 

25 neutropenia and which is often the result of such treatment. 
However the period of severe neutropenia may not be totally 
eliminated. The myeloid lineage, which is comprised of 
monocytes (macrophages), granulocytes (including 
neutrophils) and megakaryocytes, is critical in preventing 

30 infections and bleeding which can be life-threatening. 

Neutropenia may also be the result of disease, genetic 
disorders, drugs, toxins, radiation and many therapeutic 
treatments such as conventional oncology therapy. 

Bone marrow transplants have been used to treat this 

35 patient population. However, several problems are associated 
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with the use of bone marrow to reconstitute a compromised 
hematopoietic system including: 1) the number of stem cells 
in bone marrow or other tissues, such as spleen or 
peripheral blood, is limited, 2) Graft Versus Host Disease, 
5 3) graft rejection and 4) possible contamination with tumor 
cells. Stem cells and progenitor cells make up a very small 
percentage of the nucleated cells in the bone marrow, spleen 
and peripheral blood. It is clear that a dose response 
exists such that a greater number of multipotential 

10 hematopoietic progenitors will enhance hematopoietic 

recovery. Therefore, the in vitro expansion of stem cells 
should enhance hematopoietic recovery and patient survival. 
Bone marrow from an allogeneic donor has been used to 
provide bone marrow for transplant. However, Graft Versus 

15 Host Disease and graft rejection limit bone marrow 

transplantation even in recipients with HLA-matched sibling 
donors. An alternative to allogeneic bone marrow transplants 
is autologous bone marrow transplants. In autologous bone 
marrow transplants, some of the patient's own marrow is 

20 harvested prior to myeloablative therapy, e.g. high dose 
chemotherapy, and is transplanted back into the patient 
afterwards. Autologous transplants eliminate the risk of 
Graft Versus Host Disease and graft rejection. However, 
autologous bone marrow transplants still present problems in 

25 terms of the limited number of stems cells in the marrow and 
possible contamination with tumor cells. The limited number 
of multipotential hematopoietic progenitors may be overcome 
by ex-vivo expansion of the multipotential hematopoietic 
progenitors. In addition, stem cells can be specifically 

30 isolated based on the presence of specific surface antigens 
such as CD34+ in order to decrease tumor cell contamination 
of the marrow graft. 

The fallowing patents contain further details on 
35 separating stem cells, CD34+ cells, culturing the cells with 
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hematopoietic factors, the use of the cells for the 
treatment of patients with hematopoietic disorders and the 
use of hematopoietic factors for cell expansion and gene 
therapy . 

5 

5,061,620 relates to compositions comprising human 
hematopoietic stem cells provided by separating the stem 
cells from dedicated cells. 

10 5,199,942 describes a method for autologous hematopoietic 

cell transplantation comprising: (1) obtaining hematopoietic 
progenitor cells from a patient; (2) ex-vivo expansion of 
cells with a growth factor selected from the group 
consisting of IL-3, flt3 ligand, c-kit ligand, GM CSF, IL-1, 

15 GM-CSF/IL-3 fusion protein and combinations thereof; (3) 
administering cellular preparation to a patient. 

5,240,856 relates to a cell separator that includes an 
apparatus for automatically controlling the cell separation 
20 process. 

WO 91/16116 describes devices and methods for selectively 
isolating and separating target cells from a mixture of 
cells . 

25 

WO 91/18972 describes methods for in vitro culturing of bone 
marrow, by incubating suspension of bone marrow cells, using 
a hollow fiber bioreactor. 

30 WO 92/18615 relates to a process for maintaining and 

expanding bone marrow cells, in a culture medium containing 
specific mixtures of cytokines, for use in transplants. 

WO 93/08268 describes a method for selectively expanding 
35 stem cells, comprising the steps of (a) separating CD34+ 
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stem cells from other cells and (b) incubating the separated 
cells in a selective medium, such that the stem cells are 
selectively expanded. 

5 WO 93/18136 describes a process for in vitro support of 

mammalian cells derived from peripheral blood, 

WO 93/18648 relates to a composition comprising human 
neutrophil precursor cells with a high content of 
10 myeloblasts and promyelocytes for treating genetic or 
acquired neutropenia. 

WO 94/08039 describes a method of enrichment for human 
hematopoietic stem cells by selection for cells which 
15 express c-kit protein. 

WO 94/11493 describes a stem cell population that are CD34+ 
and small in size, which are isolated using a counterflow 
elutriation method. 

20 

WO 94/27698 relates to a method combining immunoaf f inity 
separation and continuous flow centrifugal separation for 
the selective separation of a nucleated heterogeneous cell 
population from a heterogeneous cell mixture. 

25 

WO 94/25848 describes a cell separation apparatus for 
collection and manipulation of target cells. 

The long term culturing of highly enriched CD34+ precursors 
30 of hematopoietic progenitor cells from human bone marrow in 
cultures containing IL-la, IL-3, IL-6 or GM-CSF is 

discussed in Brandt et al (J. Clin. Invest. 86:932-941, 
1990) . 
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One aspect of the present invention provides a method 
for selective ex-vivo expansion of stem cells. The term 
"stem cell" refers to the multipotential hematopoietic cells 
as well as early myeloid progenitor and precursors cells 
5 which can be isolated from bone marrow, spleen or peripheral 
blood. The term -expansion 11 refers to the proliferation and 
differentiation of the cells. The present invention provides 
a method for selective ex-vivo expansion of stem cells, 
comprising the steps of; (a) separating stem cells from 

10 other cells, (b) culturing the separated stem cells with a 

selective medium which contains a G-CSF receptor agonist and 
optionally a second colony stimulating factor, and (c) 
harvesting the cultured stems cells. Stem cells, as well as 
committed progenitor cells destined to become neutrophils, 

15 erythrocytes, platelets, etc., may be distinguished from 
most other cells by the presence or absence of particular 
progenitor marker antigens, such as CD34, that are present 
on the surface of these cells and/or by morphological 
characteristics. The phenotype for a highly enriched human 

20 stem cell fraction is reported as CD34+, Thy-1+ and lin-, 

but it is to be understood that the present invention is not 
limited to the expansion of this stem cell population. The 
CD34+ enriched human stem cell fraction can be separated by 
a number of reported methods, including affinity columns or 

25 beads, magnetic beads or flow cytometry using antibodies 
directed to surface antigens such as the CD34+. Further, 
physical separation methods such as counterflow elutriation 
may be used to enrich hematopoietic progenitors. The CD34+ 
progenitors are heterogeneous, and may be divided into 

30 several sub-populations characterized by the presence or 
absence of co-expression of different lineage associated 
cell surface associated molecules. The most immature 
progenitor cells do not express any known lineage associated 
markers, such as HLA-DR or CD38, but they may express 

35 CD90(thy-l). Other surface antigens such hs CD33, CD38, 
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CD41, CD71, HLA-DR or c-kit can also be used to selectively 
isolate hematopoietic progenitors. The separated cells can 
be incubated in selected medium in a culture flask, sterile 
bag or in hollow fibers. Various colony stimulating factors 
5 may be utilized in order to selectively expand cells. 

Representative factors that have been utilized for ex-vivo 
expansion of bone marrow include, c-kit ligand, IL-3, G-CSF, 
GM-CSF, IL-1, IL-6, IL-11, flt-3 ligand or combinations 
thereof. The proliferation of the stem cells can be 
10 monitored by enumerating the number of stem cells and other 
cells, by standard techniques (e.g. hemacytometer, CFU, 
LTCIC) or by flow cytometry prior and subsequent to 
incubation. 

15 Several methods for ex-vivo expansion of stem cells 

have been reported utilizing a number of selection methods 
and expansion using various colony stimulating factors 
including c-kit ligand (Brandt et al., Blood 83:1507-1514, 
1994; McKenna et al., Blood 86 : 3413-3420 , 1995), IL-3 
0 (Brandt et al . , Blood 83:1507-1514, 1994; Sato et al., Blood 
82:3600-3609, 1993), G-CSF (Sato et al., Blood 82:3600-3609, 
1993), GM-CSF (Sato et al., Blood 82:3600-3609, 1993), IL-1 
(Muench et al . , Blood 81:3463-3473, 1993), IL-6 (Sato et 
al., Blood 82:3600-3609, 1993), IL-11 (Lemoli et al . , Exp. 

25 Hem. 21:1668-1672, 1993; Sato et al., Blood 82:3600-3609, 
1993), flt-3 ligand (McKenna et al., Blood 86:3413 3420, 
1995) and/or combinations thereof (Brandt et al., Blood 
83:1507 1514, 1994; Haylock et al., Blood 80:1405-1412, 
1992, Roller et al., Biotechnology 11:358-363, 1993; Lemoli 

30 et al., Exp. Hem. 21:1668-1672, 1993), McKenna et al., Blood 
86:3413-3420, 1995; Muench et al., Blood 81:3463-3473, 1993; 
Patchen et al., Biotherapy 7:13-26, 1994; Sato et al., Blood 
82:3600-3609, 1993; Smith et al . , Exp. Hem. 21:870-877, 
1993; Steen et al., Stem Cells 12:214-224, 1994; Tsujino et 

35 al., Exp. Hem. 21:1379-1386, 1993). Among the individual 
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colony stimulating factors, hIL-3 has been shown to be one 
of the most potent in expanding peripheral blood CD34+ cells 
(Sato et al. f Blood 82:3600-3609, 1993; Kobayashi et al., 
Blood 73:1836-1841, 1989). However, no single factor has 
5 been shown to be as effective as the combination of multiple 
factors. The present invention provides methods for ex vivo 
expansion that utilize novel G-CSF receptor agonists. 

Another aspect of the invention provides methods of 
10 sustaining and/or expanding hematopoietic precursor cells 
which includes inoculating the cells into a culture vessel 
which contains a culture medium that has been conditioned by 
exposure to a stromal cell line such as HS-5 (WO 96/02662, 
Roecklein and Torok-Strob, Blood 85:997-1105, 1995) that has 
15 been supplemented with a G-CSF receptor agonist of the 
present invention. 

Another projected clinical use of growth factors has 
been in the in vitro activation of hematopoietic progenitors 

20 and stem cells for gene therapy. Due to the long life-span 
of hematopoietic progenitor cells and the distribution of 
their daughter cells throughout the entire body, 
hematopoietic progenitor cells are good candidates for ex 
vivo gene transf ection. In order to have the gene of 

2 5 interest incorporated into the genome of the hematopoietic 

progenitor or stem cell one needs to stimulate cell division 
and DNA replication. Hematopoietic stem cells cycle at a 
very low frequency which means that growth factors may be 
useful to promote gene transduction and thereby enhance the 

30 clinical prospects for gene therapy. Potential applications 
of gene therapy (review Crystal, Science 270:404-410, 1995) 
include; 1) the treatment of many congenital metabolic 
disorders and immunodeficiencies (Kay and Woo, Trends Genet. 
10:253-257, 1994), 2) neurological disorders (Friedmann, 

35 Trends Genet. 10:210-214, 1994), 3) cancer (Culver and 
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Blaese, Trends Genet. 10:174-178, 1994) and 4) infectious 
diseases (Gilboa and Smith, Trends Genet. 10:139-144, 1994). 

There are a variety of methods, known to those with 
skill in the art, for introducing genetic material into a 
5 host cell. A number of vectors, both viral and non-viral 

have been developed for transferring therapeutic genes into 
primary cells. Viral based vectors include; 1) replication 
deficient recombinant retrovirus (Boris-Lawrie and Temin, 
Curr. Opin. Genet. Dev. 3:102-109, 1993; Boris-Lawrie and 

10 Temin, Annal . New York Acad. Sci. 716:59-71, 1994; Miller, 
Current Top. Microbiol. Immunol. 158:1-24, 1992) and 
replication-deficient recombinant adenovirus (Berkner, 
BioTechniques 6:616-629, 1988; Berkner, Current Top. 
Microbiol. Immunol. 158:39-66, 1992; Brody and Crystal, 

15 Annal. New York Acad. Sci. 716:90-103, 1994). Non-viral 

based vectors include protein/DNA complexes (Cristiano et 
al., PNAS USA. 90:2122-2126, 1993; Curiel et al . , PNAS USA 
88:8850-8854, 1991; Curiel, Anna 1 . New York Acad. Sci. 
716:36-58, 1994), electroporation and liposome mediated 

20 delivery such as cationic liposomes (Farhood et al., Annai. 
New York Acad. Sci. 716:23-35, 1994). 

The present invention provides an improvement to the 
existing methods of expanding hematopoietic cells, into 
which new genetic material has been introduced, in that it 

25 provides methods utilizing G-CSF receptor agonists that may 
have improved biological activity and/or physical 
properties . 



30 Determination of the Linker 

The length of the amino acid sequence of the linker 
can be selected empirically or with guidance from structural 
information, or by using a combination of the two 
35 approaches. 
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When no structural information is available, a small 
series of linkers can be prepared for testing using a design 
whose length is varied in order to span a range from 0 to 50 
A and whose sequence is chosen in order to be consistent 
5 with surface exposure (hydrophilicity , Hopp & Woods, Mol. 

Immunol. 20: 483-489, 1983; Kyte & Doolittle, J. Mol . Biol. 
157:105-132, 1982; solvent exposed surface area, Lee & 
Richards, J. Mol . Biol. 55:379-400, 1971) and the ability to 
adopt the necessary conformation without deranging the 

10 configuration of the c-mpl receptor agonist 

(conformational^ flexible; Karplus & Schulz, 
Naturwissenschaften 72:212-213, (1985), Assuming an average 
of translation of 2.0 to 3.8 A per residue, this would mean 
the length to test would be between 0 to 30 residues, with 0 

15 to 15 residues being the preferred range. Exemplary of such 
an empirical series would be to construct linkers using a 
cassette sequence such as Gly-Gly-Gly-Ser (SEQ ID N0:2) 
repeated n times, where n is 1, 2, 3 or 4. Those skilled in 
the art will recognize that there are many such sequences 

20 that vary in length or composition that can serve as linkers 

with the primary consideration being that they be neither 
excessively long nor short (cf., Sandhu, Critical Rev. 
Biotech. 12: 437-462, 1992); if they are too long, entropy 
effects will likely destabilize the three-dimensional fold, 

25 and may also make folding kinetically impractical, and if 
they are too short, they will likely destabilize the 
molecule because of torsional or steric strain. 

Those skilled in the analysis of protein structural 
30 information will recognize that using the distance between 
the chain ends, defined as the distance between the c-alpha 
carbons, can be used to define the length of the sequence to 
be used, or at least to limit the number of possibilities 
that must be tested in an empirical selection of linkers. 
3 5 They will .*lso recognize that it is sometimes the case that 



WO 97/1 2977 PCT7US96/1 5935 

27 



the positions of the ends of the polypeptide chain are ill- 
defined in structural models derived from x-ray diffraction 
or nuclear magnetic resonance spectroscopy data, and that 
when true, this situation will therefore need to be taken 
5 into account in order to properly estimate the length of the 
linker required. From those residues whose positions are 
well defined are selected two residues that are close in 
sequence to the chain ends, and the distance between their 
c-alpha carbons is used to calculate an approximate length 

10 for a linker between them. Using the calculated length as a 
guide, linkers with a range of number of residues 
(calculated using 2 to 3.8A per residue) are then selected. 
These linkers may be composed of the original sequence, 
shortened or lengthened as necessary, and when lengthened 

15 the additional residues may be chosen to be flexible and 

hydrophilic as described above; or optionally the original 
sequence may be substituted for using a series of linkers, 
one example being the Gly-Gly-Gly-Ser (SEQ ID NO: 2) cassette 
approach mentioned above; or optionally a combination of the 

20 original sequence and new sequence having the appropriate 
total length may be used. 

ngt-Prmination of the Amino and Carboxvl Termini of G-CSF 

25 Bficentflj: Agonists 

Sequences of G-CSF receptor agonists capable of 
folding to biologically active states can be prepared by 
appropriate selection of the beginning (amino terminus) and 

30 ending (carboxyl terminus) positions from within the 

original polypeptide chain while using the linker sequence 
as described above. Amino and carboxyl termini are selected 
from within a common stretch of sequence, referred to as a 
breakpoint region, using the guidelines described below. A 

35 novel amino acid sequence is thus generated by selecting 
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amino and carboxyl termini from within the same breakpoint 
region. l:i many cases the selection of the new termini will 
be such that the original position of the carboxyl terminus 
immediately preceded that of the amino terminus. However, 
5 those skilled in the art will recognize that selections of 
termini anywhere within the region may function, and that 
these will effectively lead to either deletions or additions 
to the amino or carboxyl portions of the new sequence. 

It is a central tenet of molecular biology that the 

10 primary amino acid sequence of a protein dictates folding to 
the three-dimensional structure necessary for expression of 
its biological function. Methods are known to those skilled 
in the art to obtain and interpret three-dimensional 
structural information using x-ray diffraction of single 

15 protein crystals or nuclear magnetic resonance spectroscopy 
of protein solutions. Examples of structural information 
that are relevant to the identification of breakpoint 
regions include the location and type of protein secondary 
structure (alpha and 3-10 helices, parallel and anti- 

20 parallel beta sheets, chain reversals and turns, and loops; 

Kabsch & Sander, Biopolymers 22: 2577-2637, 1983; the degree 
of solvent exposure of amino acid residues, the extent and 
type of interactions of residues with one another (Chothia, 
Ann. Rev. Biochem. 53:537-572; 1984) and the static and 

25 dynamic distribution of conformations along the polypeptide 
chain (Alber & Mathews, Methods Enzymol . 154: 511-533, 
1987). In some cases additional information is known about 
solvent exposure of residues; one example is a site of post- 
translational attachment of carbohydrate which is 

30 necessarily on the surface of the protein. When 

experimental structural information is not available, or is 
not feasible to obtain, methods are also available to 
analyze the primary amino acid sequence in order to make 
predictions of protein tertiary and secondary structure, 

35 solvent accessibility and the occurrence of turns and loops. 
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Biochemical methods are also sometimes applicable for 
empirically determining surface exposure when direct 
structural methods are not feasible; for example, using the 
identification of sites of chain scission following limited 
5 proteolysis in order to infer surface exposure (Gentile & 
Salvatore, Eur. J. Biochem. 218:603-621, 1993) 
Thus using either the experimentally derived structural 
information or predictive methods (e.g., Srinivisan & Rose. 
Proteins: Struct., Funct. & Genetics, 22: 81-99, 1995) the 

10 parental amino acid sequence is inspected to classify 

regions according to whether or not they are integral to the 
maintenance of secondary and tertiary structure. The 
occurrence of sequences within regions that are known to be 
involved in periodic secondary structure (alpha and 3-10 

15 helices, parallel and anti-parallel beta sheets) are regions 
that should be avoided. Similarly, regions of amino acid 
sequence that are observed or predicted to have a low degree 
of solvent exposure are more likely to be part of the so- 
called hydrophobic core of the protein and should also be 

20 avoided for selection of amino and carboxyl termini. In 

contrast, those regions that are known or predicted to be in 
surface turns or loops, and especially those regions that 
are known not to be required for biological activity, are 
the preferred sites for location of the extremes of the 

25 polypeptide chain. Continuous stretches of amino acid 

sequence that are preferred based on the above criteria are 
referred to as a breakpoint region. 
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TABLE 1 
QLTflONTTCTjEOTTDRc: 



10 



20 



25 



30 



40 



45 



50 



L-llstart . seq 



L-llstop . seq 



BlstartP. seq 
15 blstopP.seq 
39start .seq 



38stop.Seq 



97start .seq 



96stop.Seq 



126start . seq 



125scop.Seq 



35 133start.seq 



132stop.seq 



142start .seq 



141stop.Seq 



96for .Seq 



9 6 rev. Seq 



GCTCTGAGAG CCGCCAGAGC CGCCAGAGGG 
CTGCGCAAGG TGGCGTAGAA CGCG 
(SEQ ID NO: 3) 

CAGCCCTCTG GCGGCTCTGG CGGCTCTCAG 
AGCTTCCTGC TCAAGTCTTT AGAG 
(SEQ ID NO: 4) 

GGGCTGCGCA AGGTGGCG (SEQ ID NO: 5) 

ACACCATTGG GCCCTGCCAG C (SEQ ID NO: 6) 

GATCGACCAT GGCTTACAAG CTGTGCCACC CC 
(SEQ ID NO:7) 

CGATCGAAGC TTATTAGGTG GCACACAGCT TCTCCT 
(SEQ ID NO:8) 

GATCGACCAT GGCTCCCGAG TTGGGTCCCA CC 
(SEQ ID NO: 9) 

CGATCGAAGC TTATTAGGAT ATCCCTTCCA GGGCCT 
(SEQ ID NO: 10) 

GATCGACCAT GGCTATGGCC CCTGCCCTGC AG 
(SEQ ID NO:ll) 

CGATCGAAGC TTATTATCCC AGTTCTTCCA TCTGCT 
(SEQ ID NO:12) 

GATCGACCAT GGCTACCCAG GGTGCCATGC CG 
(SEQ ID NO: 13) 

CGATCGAAGC TTATTAGGGC TGCAGGGCAG GGGCCA 
(SEQ ID NO: 14) 

GATCGACCAT GGCTTCTGCT TTCCAGCGCC GG 
(SEQ ID NO: 15) 

CGATCGAAGC TTATTAGGCG AAGGCCGGCA TGGCAC 
(SEQ ID NO: 16) 

ATATCCATGG CTCCGGAACT GGGTCCAACT CTG 
(SEQ ID NO: 17) 

ACCTCCAGGA AGCTCTGCAG ATGG 
(SEQ ID NO: 18) 
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31 



10 



15 



20 



25 



35 



40 



45 



50 



125for.seq 



125rev. seq 



132for .seq 



132rev.seq 



141f or . seq 



141rev.seq 



49start . seq 



48stop.seq 



30 77start.seq 



76stop. seq 



82start . seq 



81stop.seq 



84start . seq 



83stop.seq 



91start . seq 



90stop .seq 



TATATCCATG GCTATGGCTC CAGCTCTGCA 
ACCAACTCAA GGTGCAATGC CAGCATTTGC ATCTG 
(SEQ ID NO: 19) 

GATGGCTAGC AACCAGAACA CCACCTGCAC 
GACGTTGAAA AGCAGATGCA AATGCTGGCA TTG 
(SEQ ID NO:20) 

TATATCCATG GCTACTCAAG GTGCTATGCC 
AGCTTTTGCT TCTGCTTTTC AACGTCG 
(SEQ ID NO:21) 

GCAGATGGCT AGCAACCAGA ACACCACCTG 
CACGACGTTG AAAAGCAGAA GCAAAAGC 
(SEQ ID NO: 22) 

CATGGCTTCT GCTTTTCAAC GTCGTGCAGG 
TGGTGTTCTG GTTG (SEQ ID NO: 23) 

CTAGCAACCA GAACACCACC TGCACGACGT 
TGAAAAGCAG AAGC (SEQ ID NO: 24) 

GATCGACCAT GGCTCTGCTC GGACACTCTC TG 
(SEQ ID NO:68) 

CGATCGAAGC TTATTACACC AGCTCCTCGG GGTGGC 
(SEQ ID NO: 69) 

GATCGACCAT GGCTCAACTC CATAGCGGCC TT 
(SEQ ID NO: 70) 

CGATCGAAGC TTATTAGCTC AAGCAGCCTG CCAGCT 
(SEQ ID NO:71) 

GATCGACCAT GGCTCTTTTC CTCTACCAGG GG 
(SEQ ID NO: 72) 

CGATCGAAGC TTATTAGCCG CTATGGAGTT GGCTCA 
(SEQ ID NO:73) 

GATCGACCAT GGCTCTCTAC CAGGGGCTCC TG 
(SEQ ID NO:74) 

CGATCGAAGC TTATTAGAAA AGGCCGCTAT GGAGTT 
(SEQ ID NO:75) 

GATCGACCAT GGCTGCCCTG GAAGGGATAT CC 
(SEQ ID NO:76) 

CGATCGAAGC TTATTACTGC AGGAGCCCCT GGTAGA 
(SEQ ID NO:77) 
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10 



15 



20 



25 



112start . seq 



lllstop . seq 



117start . seq 



116stop.seq 



119start . seq 



118stop.seq 



Z4849at .for 



Z4849at .rev 



GATCGACCAT 
(SEQ ID NO: 

CGATCGAAGC 
(SEQ ID NO: 

GATCGACCAT 
(SEQ ID NO: 

CGATCGAAGC 
(SEQ ID NO: 

GATCGACCAT 
(SEQ ID NO: 

CGATCGAAGC 
(SEQ ID NO: 

CATGGCTTTG 
TCCATGGGCT 

CAGAGGAGCC 
TCCTAACAAA 



GGCTGACTTT 
78) 

TTATTAGGCG 
79) 

GGCTATCTGG 
80) 

TTATTAGGTG 
81) 

GGCTCAGCAG 
82) 

TTATTACCAG 
83) 

TTAGGACATT 
CCTCTGAGCT 

CATGGAATAC 
GC (SEQ ID 



GCCACCACCA TC 



ACGTCCAGCT GCAGTG 



CAGCAGATGG AA 



GTGGCAAAGT CGGCGA 



ATGGAAGAAC TG 



ATGGTGGTGG CAAAGT 



CTTTAGGTAT 
(SEQ ID NO: 84) 

CTAAAGAATG 
NO:85) 



WO 97/1 2977 PCT/US96/1 5935 

33 



TABLE 2 

f)NA sequences 



pMON3485.Seq 



1 ATGGCTTACA AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC 

51 TCTGGGCATC CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGC 

101 AGCTGGCAGG CTGCTTGAGC CAACTCCATA GCGGCCTTTT CCTCTACCAG 

10 151 GGGCTCCTGC AGGCCCTGGA AGGGATATCC CCCGAGTTGG GTCCCACCTT 

201 GGACACACTG CAGCTGGACG TCGCCGACTT TGCCACCACC ATCTGGCAGC 

251 AGATGGAAGA ACTGGGAATG GCCCCTGCCC TGCAGCCCAC CCAGGGTGCC 

301 ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC CGGGCAGGAG GGGTCCTGGT 

351 TGCTAGCCAT CTGCAGAGCT TCCTGGAGGT GTCGTACCGC GTTCTACGCC 

15 401 ACCTTGCGCA GCCCTCTGGC GGCTCTGGCG GCTCTCAGAG CTTCCTGCTC 

451 AAGTCTTTAG AGCAAGTGAG GAAGATCCAG GGCGATGGCG CAGCGCTCCA 

501 GGAGAAGCTG TGTGCCACCT AATAA (SEQ ID NO: 25) 

20 pMON3486.Seq 

1 ATGGCTCCCG AGTTGGGTCC CACCTTGGAC ACACTGCAGC TGGACGTCGC 

51 CGACTTTGCC ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC 

101 CTGCCCTGCA GCCCACCCAG GGTGCCATGC CGGCCTTCGC CTCTGCTTTC 

25 151 CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT AGCCATCTGC AGAGCTTCCT 

201 GGAGG 7GTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC TCTGGCGGCT 

251 CTGGCGGCTC TCAGAGCTTC CTGCTCAAGT CTTTAGAGCA AGTGAGGAAG 

301 ATCCAGGGCG ATGGCGCAGC GCTCCAGGAG AAGCTGTGTG CCACCTACAA 

351 GCTGTGCCAC CCCGAGGAGC TGGTGCTGCT CGGACACTCT CTGGGCATCC 

30 401 CCTGGGCTCC CCTGAGCTCC TGCCCCAGCC AGGCCCTGCA GCTGGCAGGC 

451 TGCTTGAGCC AACTCCATAG CGGCCTTTTC CTCTACCAGG GGCTCCTGCA 

501 GGCCCTGGAA GGGATATCCT AATAA (SEQ ID NO: 26) 

35 pMON3487.Seq 

1 ATGGCTATGG CCCCTGCCCT GCAGCCCACC CAGGGTGCCA TGCCGGCCTT 

51 CGCCTCTGCT TTCCAGCGCC GGGCAGGAGG GGTCCTGGTT GCTAGCCATC 

101 TGCAGAGCTT CCTGGAGGTG TCGTACCGCG TTCTACGCCA CCTTGCGCAG 

40 151 CCCTCTGGCG GCTCTGGCGG CTCTCAGAGC TTCCTGCTCA AGTCTTTAGA 

201 GCAAGTGAGG AAGATCCAGG GCGATGGCGC AGCGCTCCAG GAGAAGCTGT 

251 GTGCCACCTA CAAGCTGTGC CACCCCGAGG AGCTGGTGCT GCTCGGACAC 

301 TCTCTGGGCA TCCCCTGGGC TCCCCTGAGC TCCTGCCCCA GCCAGGCCCT 

351 GCAGCTGGCA GGCTGCTTGA GCCAACTCCA TAGCGGCCTT TTCCTCTACC 

45 401 AGGGGCTCCT GCAGGCCCTG GAAGGGATAT CCCCCGAGTT GGGTCCCACC 

451 TTGGACACAC TGCAGCTGGA CGTCGCCGAC TTTGCCACCA CCATCTGGCA 

501 GCAGATGGAA GAACTGGGAT AATAA (SEQ ID NO: 27) 



50 



pMON3488.Seq 



WO 97/12977 
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34 



10 



1 ATGGCTACCC 

51 GGCAGGAGGG 

101 CGTACCGCGT 

151 TCTCAGAGCT 

201 CGATGGCGCA 

251 ACCCCGAGGA 

301 CCCCTGAGCT 

351 CCAACTCCAT 

401 AAGGGATATC 

451 GTCGCCGACT 

501 GGCCCCTGCC 



AGGGTGCCAT 
GTCCTGGTTG 
TCTACGCCAC 
TCCTGCTCAA 
GCGCTCCAGG 
GCTGGTGCTG 
CCTGCCCCAG 
AGCGGCCTTT 
CCCCGAGTTG 
TTGCCACCAC 
CTGCAGCCCT 



GCCGGCCTTC 
CTAGCCATCT 
CTTGCGCAGC 
GTCTTTAGAG 
AGAAGCTGTG 
CTCGGACACT 
CCAGGCCCTG 
TCCTCTACCA 
GGTCCCACCT 
CATCTGGCAG 
AATAA (SEQ 



GCCTCTGCTT 
GCAGAGCTTC 
CCTCTGGCGG 
CAAGTGAGGA 
TGCCACCTAC 
CTCTGGGCAT 
CAGCTGGCAG 
GGGGCTCCTG 
TGGACACACT 
CAGATGGAAG 
ID NO: 28) 



TCCAGCGCCG 
CTGGAGGTGT 
CTCTGGCGGC 
AGATCCAGGG 
AAGCTGTGCC 
CCCCTGGGCT 
GCTGCTTGAG 
CAGGCCCTGG 
GCAGCTGGAC 
AACTGGGAAT 



15 



20 



25 



pMON3489.Seq 

1 ATGGCTTCTG 

51 TCTGCAGAGC 

101 AGCCCTCTGG 

151 GAGCAAGTGA 

201 GTGTGCCACC 

251 ACTCTCTGGG 

301 CTGCAGCTGG 

351 CCAGGGGCTC 

401 CCTTGGACAC 

451 CAGCAGATGG 

501 TGCCATGCCG 



CTTTCCAGCG 
TTCCTGGAGG 
CGGCTCTGGC 
GGAAGATCCA 
TACAAGCTGT 
CATCCCCTGG 
CAGGCTGCTT 
CTGCAGGCCC 
ACTGCAGCTG 
AAGAACTGGG 
GCCTTCGCCT 



CCGGGCAGGA GGGGTCCTGG 
TGTCGTACCG CGTTCTACGC 
GGCTCTCAGA GCTTCCTGCT 
GGGCGATGGC GCAGCGCTCC 
GCCACCCCGA GGAGCTGGTG 
GCTCCCCTGA GCTCCTGCCC 
GAGCCAACTC CATAGCGGCC 
TGGAAGGGAT ATCCCCCGAG 
GACGTCGCCG ACTTTGCCAC 
AATGGCCCCT GCCCTGCAGC 
AATAA (SEQ ID NO: 29) 



TTGCTAGCCA 
CACCTTGCGC 
CAAGTCTTTA 
AGGAGAAGCT 
CTGCTCGGAC 
CAGCCAGGCC 
TTTTCCTCTA 
TTGGGTCCCA 
CACCATCTGG 
CCACCCAGGG 



30 



35 



40 



pMON3490.seq 

1 ATGGCTTACA 

51 TCTGGGCATC 

101 AGCTGGCAGG 

151 GGGCTCCTGC 

201 GGACACACTG 

251 AGATGGAAGA 

301 ATGCC3GCCT 

351 TGCTAGCCAT 

401 ACCTTGCGCA 

451 TTCCTGCTCA 

501 AGCGCTCCAG 



AGCTGTGCCA 
CCCTGGGCTC 
CTGCTTGAGC 
AGGCCCTGGA 
CAGCTGGACG 
ACTGGGAATG 
TCGCCTCTGC 
CTGCAGAGCT 
GCCCACACCA 
AGTCTTTAGA 
GAGAAGCTGT 



CCCCGAGGAG 
CCCTGAGCTC 
CAACTCCATA 
AGGGATATCC 
TCGCCGACTT 
GCCCCTGCCC 
TTTCCAGCGC 
TCCTGGAGGT 
TTGGGCCCTG 
GCAAGTGAGA 
GTGCCACCTA 



CTGGTGCTGC 
CTGCCC CAGC 
GCGGCCTTTT 
CCCGAGTTGG 
TGCCACCACC 
TGCAGCCCAC 
CGGGCAGGAG 
GTCGTACCGC 
CCAGCTCCCT 
AAGATCCAGG 
ATAA ( SEQ 



TCGGACACTC 
CAGGCCCTGC 
CCTCTACCAG 
GTCCCACCTT 
ATCTGGCAGC 
CCAGGGTGCC 
GGGTCCTGGT 
GTTCTACGCC 
GCCCCAGAGC 
GCGATGGCGC 
ID NO: 30) 



45 



50 



pMON3491.seq 

1 ATGGCTCCCG 

51 CGACTTTGCC 

101 CTGCCCTGCA 

151 CAGCGCCGGG 

201 GGAGGTGTCG 

251 GCCCTGCCAG 

301 GTGAGAAAGA 



AGTTGGGTCC 
ACCACCATCT 
GCCCACCCAG 
CAGGAGGGGT 
TACCGCGTTC 
CTCCCTGCCC 
TCCAGGGCGA 



CACCTTGGAC 
GGCAGCAGAT 
GGTGCCATGC 
CCTGGTTGCT 
TACGCCACCT 
CAGAGCTTCC 
TGGCGCAGCG 



ACACTGCAGC 
GGAAGAACTG 
CGGCCTTCGC 
AGCCATCTGC 
TGCGCAGCCC 
TGCTCAAGTC 
CTCCAGGAGA 



TGGACGTCGC 
GGAATGGCCC 
CTCTGCTTTC 
AGAGCTTCCT 
ACACCATTGG 
TTTAGAGCAA 
AGCTGTGTGC 
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351 CACCTACAAG CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC 

401 TGGGCATCCC CTGGGCTCCC CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG 

451 CTGGCAGGCT GCTTGAGCCA ACTCCATAGC GGCCTTTTCC TCTACCAGGG 

501 GCTCCTGCAG GCCCTGGAAG GGATATCCTA ATAA (SEQ ID NO: 31) 



pMON3492.seq 

1 ATGGCTATGG CCCCTGCCCT GCAGCCCACC CAGGGTGCCA TGCCGGCCTT 

51 CGCCTCTGCT TTCCAGCGCC GGGCAGGAGG GGTCCTGGTT GCTAGCCATC 

101 TGCAGAGCTT CCTGGAGGTG TCGTACCGCG TTCTACGCCA CCTTGCGCAG 

151 CCCACACCAT TGGGCCCTGC CAGCTCCCTG CCCCAGAGCT TCCTGCTCAA 

201 GTCTTTAGAG CAAGTGAGAA AGATCCAGGG CGATGGCGCA GCGCTCCAGG 

251 AGAAGCTGTG TGCCACCTAC AAGCTGTGCC ACCCCGAGGA GCTGGTGCTG 

301 CTCGGACACT CTCTGGGCAT CCCCTGGGCT CCCCTGAGCT CCTGCCCCAG 

351 CCAGGCCCTG CAGCTGGCAG GCTGCTTGAG CCAACTCCAT AGCGGCCTTT 

401 TCCTCTACCA GGGGCTCCTG CAGGCCCTGG AAGGGATATC CCCCGAGTTG 

451 GGTCCCACCT TGGACACACT GCAGCTGGAC GTCGCCGACT TTGCCACCAC 

501 CATCTGGCAG CAGATGGAAG AACTGGGATA ATAA (SEQ ID NO: 32) 



PMON3493 .seq 

1 ATGGCTACCC AGGGTGCCAT GCCGGCCTTC GCCTCTGCTT TCCAGCGCCG 

51 GGCAG GXGGG GTCCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT 

101 CGTACCGCGT TCTACGCCAC CTTGCGCAGC CCACACCATT GGGCCCTGCC 

151 AGCTCCCTGC CCCAGAGCTT CCTGCTCAAG TCTTTAGAGC AAGTGAGAAA 

201 GATCCAGGGC GATGGCGCAG CGCTCCAGGA GAAGCTGTGT GCCACCTACA 

251 AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC TCTGGGCATC 

301 CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG 

351 CTGCTTGAGC CAACTCCATA GCGGCCTTTT CCTCTACCAG GGGCTCCTGC 

401 AGGCCCTGGA AGGGATATCC CCCGAGTTGG GTCCCACCTT GGACACACTG 

451 CAGCTGGACG TCGCCGACTT TGCCACCACC ATCTGGCAGC AGATGGAAGA 

501 ACTGGGAATG GCCCCTGCCC TGCAGCCCTA ATAA (SEQ ID NO: 33) 



pMON3494.seq 

1 ATGGCTTCTG CTTTCCAGCG CCGGGCAGGA GGGGTCCTGG TTGCTAGCCA 

51 TCTGCAGAGC TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC 

101 AGCCCACACC ATTGGGCCCT GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC 

151 AAGTCTTTAG AGCAAGTGAG AAAGATCCAG GGCGATGGCG CAGCGCTCCA 

201 GGAGAAGCTG TGTGCCACCT ACAAGCTGTG CCACCCCGAG GAGCTGGTGC 

251 TGCTCGGACA CTCTCTGGGC ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC 

301 AGCCAGGCCC TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCCT 

351 TTTCCTCTAC CAGGGGCTCC TGCAGGCCCT GGAAGGGATA TCCCCCGAGT 

401 TGGGTCCCAC CTTGGACACA CTGCAGCTGG ACGTCGCCGA CTTTGCCACC 

451 ACCATCTGGC AGCAGATGGA AGAACTGGGA ATGGCCCCTG CCCTGCAGCC 

501 CACCCAGGGT GCCATGCCGG CCTTCGCCTA ATAA (SEQ ID NO: 34) 



pMON25181.seq 
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36 



PCT/US96/1593S 



1 ATGGCTCCGG AACTGGGTCC AACTCTGGAC ACACTGCAGC TGGACGTCGC 

51 CGACTTTGCC ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC 

101 CTGCCCTGCA GCCCACCCAG GGTGCCATGC CGGCCTTCGC CTCTGCTTTC 

5 151 CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT AGCCATCTGC AGAGCTTCCT 

201 GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC ACACCATTGG 

251 GCCCTGCCAG CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA 

301 GTGAGAAAGA TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC 

351 CACCTACAAG CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC 

10 401 TGGGCATCCC CTGGGCTCCC CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG 

451 CTGGCAGGCT GCTTGAGCCA ACTCCATAGC GGCCTTTTCC TCTACCAGGG 

501 GCTCCTGCAG GCCCTGGAAG GGATATCCTA A (SEQ ID NO: 35) 



15 pMON25182.seq 



25 



1 


ATGGCTATGG 


51 


TGCATCTGCT 


101 


TGCAGAGCTT 


151 


CCCACACCAT 


201 


GTCTTTAGAG 


251 


AGAAGCTGTG 


301 


CTCGGACACT 


351 


CCAGGCCCTG 


401 


TCCTCTACCA 


451 


GGTCCCACCT 


501 


CATCTGGCAG 



CTCCAGCTCT GCAACCAACT 
TTTCAACGTC GTGCAGGTGG 
CCTGGAGGTG TCGTACCGCG 
TGGGCCCTGC CAGCTCCCTG 
CAAGTGAGAA AGATCCAGGG 
TGCCACCTAC AAGCTGTGCC 
CTCTGGGCAT CCCCTGGGCT 
CAGCTGGCAG GCTGCTTGAG 
GGGGCTCCTG CAGGCCCTGG 
TGGACACACT GCAGCTGGAC 
CAGATGGAAG AACTGGGATA 



CAAGGTGCAA TGCCAGCATT 
TGTTCTGGTT GCTAGCCATC 
TTCTACGCCA CCTTGCGCAG 
CCCCAGAGCT TCCTGCTCAA 
CGATGGCGCA GCGCTCCAGG 
ACCCCGAGGA GCTGGTGCTG 
CCCCTGAGCT CCTGCCCCAG 
CCAACTCCAT AGCGGCCTTT 
AAGGGATATC CCCCGAGTTG 
GTCGCCGACT TTGCCACCAC 
A (SEQ ID NO:36) 



30 pMON25183.seq 



35 



40 



1 


ATGGCTACTC 


51 


TGCAGGTGGT 


101 


CGTACCGCGT 


151 


AGCTCCCTGC 


201 


GATCCAGGGC 


251 


AGCTGTGCCA 


301 


CCCTGGGCTC 


351 


CTGCTTGAGC 


401 


AGGCCCTGGA 


451 


CAGCTGGACG 


501 


ACTGGGAATG 



AAGGTGCTAT GCCAGCTTTT 
GTTCTGGTTG CTAGCCATCT 
TCTACGCCAC CTTGCGCAGC 
CCCAGAGCTT CCTGCTCAAG 
GATGGCGCAG CGCTCCAGGA 
CCCCGAGGAG CTGGTGCTGC 
CCCTGAGCTC CTGCCCCAGC 
CAACTCCATA GCGGCCTTTT 
AGGGATATCC CCCGAGTTGG 
TCGCCGACTT TGCCACCACC 
GCCCCTGCCC TGCAGCCCTA 



GCTTCTGCTT TTCAACGTCG 
GCAGAGCTTC CTGGAGGTGT 
CCACACCATT GGGCCCTGCC 
TCTTTAGAGC AAGTGAGAAA 
GAAGCTGTGT GCCACCTACA 
TCGGACACTC TCTGGGCATC 
CAGGCCCTGC AGCTGGCAGG 
CCTCTACCAG GGGCTCCTGC 
GTCCCACCTT GGACACACTG 
ATCTGGCAGC AGATGGAAGA 
A (SEQ ID NO: 37) 



45 



pMON25184.seq 



1 ATGGCTTCTG 

51 TCTGCAGAGC 

101 AGCCCACACC 

50 151 AAGTCTTTAG 

201 GGAGAAGCTG 

251 TGCTCGGACA 



CTTTTCAACG 
TTCCTGGAGG 
ATTGGGCCCT 
AGCAAGTGAG 
TGTGCCACCT 
CTCTCTGGGC 



TCGTGCAGGT 
TGTCGTACCG 
GCCAGCTCCC 
AAAGATCCAG 
ACAAGCTGTG 
ATCCCCTGGG 



GGTGTTCTGG 
CGTTCTACGC 
TGCCCCAGAG 
GGCGATGGCG 
CCACCCCGAG 
CTCCCCTGAG 



TTGCTAGCCA 
CACCTTGCGC 
CTTCCTGCTC 
CAGCGCTCCA 
GAGCTGGTGC 
CTCCTGCCCC 



WO 97/12977 



37 



PCT/US96/15935 



301 AGCC.*OGCCC TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCCT 

351 TTTCC TCTAC CAGGGGCTCC TGCAGGCCCT GGAAGGGATA TCCCCCGAGT 

401 TGGGTCCCAC CTTGGACACA CTGCAGCTGG ACGTCGCCGA CTTTGCCACC 

451 ACCATCTGGC AGCAGATGGA AGAACTGGGA ATGGCCCCTG CCCTGCAGCC 

5 501 CACCCAGGGT GCCATGCCGG CCTTCGCCTA A (SEQ ID NO: 38) 

pMON25185.seq 

10 1 ATGGCTCCGG AACTGGGTCC AACTCTGGAC ACACTGCAGC TGGACGTCGC 

51 CGACTTTGCC ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC 

101 CTGCCCTGCA GCCCACCCAG GGTGCCATGC CGGCCTTCGC CTCTGCTTTC 

151 CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT AGCCATCTGC AGAGCTTCCT 

201 GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC TCTGGCGGCT 

15 251 CTGGCGGCTC TCAGAGCTTC CTGCTCAAGT CTTTAGAGCA AGTGAGAAAG 

301 ATCCAGGGCG ATGGCGCAGC GCTCCAGGAG AAGCTGTGTG CCACCTACAA 

351 GCTGTGCCAC CCCGAGGAGC TGGTGCTGCT CGGACACTCT CTGGGCATCC 

401 CCTGGGCTCC CCTGAGCTCC TGCCCCAGCC AGGCCCTGCA GCTGGCAGGC 

451 TGCTTGAGCC AACTCCATAG CGGCCTTTTC CTCTACCAGG GGCTCCTGCA 

20 501 GGCCCTGGAA GGGATATCCT AA (SEQ ID NO: "9) 

pMON25186.seq 

25 1 ATGGCTATGG CTCCAGCTCT GCAACCAACT CAAGGTGCAA TGCCAGCATT 

51 TGCATCTGCT TTTCAACGTC GTGCAGGTGG TGTTCTGGTT GCTAGCCATC 

101 TGCAGAGCTT CCTGGAGGTG TCGTACCGCG TTCTACGCCA CCTTGCGCAG 

151 CCCTCTGGCG GCTCTGGCGG CTCTCAGAGC TTCCTGCTCA AGTCTTTAGA 

201 GCAAGTGAGA AAGATCCAGG GCGATGGCGC AGCGCTCCAG GAGAAGCTGT 

30 251 GTGCCACCTA CAAGCTGTGC CACCCCGAGG AGCTGGTGCT GCTCGGACAC 

301 TCTCTGGGCA TCCCCTGGGC TCCCCTGAGC TCCTGCCCCA GCCAGGCCCT 

351 GCAGCTGGCA GGCTGCTTGA GCCAACTCCA TAGCGGCCTT TTCCTCTACC 

401 AGGGGCTCCT GCAGGCCCTG GAAGGGATAT CCCCCGAGTT GGGTCCCACC 

451 TTGGACACAC TGCAGCTGGA CGTCGCCGAC TTTGCCACCA CCATCTGGCA 

35 501 GCAGATGGAA GAACTGGGAT AA (SEQ ID NO: 40) 

pMON25187.seq 

40 1 ATGGCTACTC AAGGTGCTAT GCCAGCTTTT GCTTCTGCTT TTCAACGTCG 

51 TGCAGGTGGT GTTCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT 

101 CGTACCGCGT TCTACGCCAC CTTGCGCAGC CCTCTGGCGG CTCTGGCGGC 

151 TCTCAGAGCT TCCTGCTCAA GTCTTTAGAG CAAGTGAGAA AGATCCAGGG 

201 CGATGGCGCA GCGCTCCAGG AGAAGCTGTG TGCCACCTAC AAGCTGTGCC 

45 251 ACCCCGAGGA GCTGGTGCTG CTCGGACACT CTCTGGGCAT CCCCTGGGCT 

301 CCCCTGAGCT CCTGCCCCAG CCAGGCCCTG CAGCTGGCAG GCTGCTTGAG 

351 CCAACTCCAT AGCGGCCTTT TCCTCTACCA GGGGCTCCTG CAGGCCCTGG 

401 AAGGGATATC CCCCGAGTTG GGTCCCACCT TGGACACACT GCAGCTGGAC 

451 GTCGCCGACT TTGCCACCAC CATCTGGCAG CAGATGGAAG AACTGGGAAT 

50 501 GGCCCCTGCC CTGCAGCCCT AA (SEQ ID NO: 41) 



v 1 
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pMON25188.seq 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1 ATGGCTTCTG 

51 TCTGCAGAGC 

101 AGCCCTCTGG 

151 GAGCAAGTGA 

201 GTGTGCCACC 

251 ACTCTCTGGG 

301 CTGCAGCTGG 

351 CCAGG-iGCTC 

401 CCTTGGACAC 

451 CAGCAGATGG 

501 TGCCATGCCG 



pMON3460.seq 



CTTTTCAACG 
TTCCTGGAGG 
CGGCTCTGGC 
GAAAGATCCA 
TACAAGCTGT 
CATCCCCTGG 
CAGGCTGCTT 
CTGCAGGCCC 
ACTGCAGCTG 
AAGAACTGGG 
GCCTTCGCCT 



TCGTGCAGGT 
TGTCGTACCG 
GGCTCTCAGA 
GGGCGATGGC 
GCCACCCCGA 
GCTCCCCTGA 
GAGCCAACTC 
TGGAAGGGAT 
GACGTCGCCG 
AATGGCCCCT 
AA (SEQ ID 



GGTGTTCTGG 
CGTTCTACGC 
GCTTCCTGCT 
GCAGCGCTCC 
GGAGCTGGTG 
GCTCCTGCCC 
CATAGCGGCC 
ATCCCCCGAG 
ACTTTGCCAC 
GCCCTGCAGC 
NO : 42 ) 



TTGCTAGCCA 
CACCTTGCGC 
CAAGTCTTTA 
AGGAGAAGCT 
CTGCTCGGAC 
CAGC CAGGCC 
TTTTCCTCTA 
TTGGGTCCCA 
CACCATCTGG 
CCACCCAGGG 



1 ATGGCTCTGC TCGGACACTC TCTGGGCATC CCCTGGGCTC CCCTGAGCTC 

51 CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC CAACTCCATA 

101 GCGGCCTTTT CCTCTACCAG GGGCTCCTGC AGGCCCTGGA AGGGATATCC 

151 CCCGAGTTGG GTCCCACCTT GGACACACTG CAGCTGGACG TCGCCGACTT 

201 TGCCACCACC ATCTGGCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC 

251 TGCAGCCCAC CCAGGGTGCC ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC 

301 CGGGCAGGAG GGGTCCTGGT TGCTAGCCAT CTGCAGAGCT TCCTGGAGGT 

351 GTCGTACCGC GTTCTACGCC ACCTTGCGCA GCCCACACCA TTGGGCCCTG 

401 CCAGCTCCCT GCCCCAGAGC TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA 

451 AAGATCCAGG GCGATGGCGC AGCGCTCCAG GAGAAGCTGT GTGCCACCTA 

501 CAAGCTGTGC CACCCCGAGG AGCTGGTGTA ATAA (SEQ ID NO: 86) 



pMON3461.seq 

1 ATGGCTCAAC 

51 CCTGGAAGGG 

101 TGGACGTCGC 

151 GGAATGGCCC 

201 CTCTGCTTTC 

251 AGAGCTTCCT 

301 ACACCATTGG 

351 TTTAGAGCAA 

401 AGCTGTGTGC 

451 GGACACTCTC 

501 GGCCCTGCAG 



TCCATAGCGG 
ATATCCCCCG 
CGACTTTGCC 
CTGCCCTGCA 
CAGCGCCGGG 
GGAGGTGTCG 
GCCCTGCCAG 
GTGAGAAAGA 
CACCTACAAG 
TGGGCATCCC 
CTGGCAGGCT 



PMON3462 .seq 

1 ATGGCTCTTT TCCTCTACCA 

51 CCCCGAGTTG GGTCCCACCT 

101 TTGCCACCAC CATCTGGCAG 

151 CTGCAGCCCA CCCAGGGTGC 

201 CCGGGCAGGA GGGGTCCTGG 



CCTTTTCCTC 
AGTTGGGTCC 
ACCACCATCT 
GCCCACCCAG 
CAGGAGGGGT 
TACCGCGTTC 
CTCCCTGCCC 
TCCAGGGCGA 
CTGTGCCACC 
CTGGGCTCCC 
GCTTGAGCTA 



GGGGCTCCTG 
TGGACACACT 
CAGATGGAAG 
CATGCCGGCC 
TTGCTAGCCA 



TACCAGGGGC 
CACCTTGGAC 
GGCAGCAGAT 
GGTGCCATGC 
CCTGGTTGCT 
TACGCCACCT 
CAGAGCTTCC 
TGGCGCAGCG 
CCGAGGAGCT 
CTGAGCTCCT 
ATAA (SEQ 



TCCTGCAGGC 
ACACTGCAGC 
GGAAGAACTG 
CGGCCTTCGC 
AGCCATCTGC 
TGCGCAGCCC 
TGCTCAAGTC 
CTCCAGGAGA 
GGTGCTGCTC 
GCCCCAGCCA 
ID NO: 87) 



CAGGCCCTGG 
GCAGCTGGAC 
AACTGGGAAT 
TTOJCCTCTG 
TCTGCAGAGC 



AAGGGATATC 
GTCGCCGACT 
GGCCCCTGCC 
CTTTCCAGCG 
TTCCTGGAGG 
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251 TGTCGTACCG CGTTCTACGC CACCTTGCGC AGCCCACACC ATTGGGCCCT 

301 GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC AAGTCTTTAG AGCAAGTGAG 

351 AAAGATCCAG GGCGATGGCG CAGCGCTCCA GGAGAAGCTG TGTGCCACCT 

401 ACAAGCTGTG CCACCCCGAG GAGCTGGTGC TGCTCGGACA CTCTCTGGGC 

451 ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC AGCCAGGCCC TGCAGCTGGC 

501 AGGCTGCTTG AGCCAACTCC ATAGCGGCTA ATAA (SEQ ID NO: 88) 



pMON3463 .seq 

1 ATGGCTCTCT ACCAGGGGCT CCTGCAGGCC CTGGAAGGGA TATCCCCCGA 

51 GTTGGGTCCC ACCTTGGACA CACTGCAGCT GGACGTCGCC GACTTTGCCA 

101 CCACCATCTG GCAGCAGATG GAAGAACTGG GAATGGCCCC TGCCCTGCAG 

151 CCCACCCAGG GTGCCATGCC GGCCTTCGCC TCTGCTTTCC AGCGCCGGGC 

201 AGGAGGGGTC CTGGTTGCTA GCCATCTGCA GAGCTTCCTG GAGGTGTCGT 

251 ACCGCGTTCT ACGCCACCTT GCGCAGCCCA CACCATTGGG CCCTGCCAGC 

301 TCCCTGCCCC AGAGCTTCCT GCTCAAGTCT TTAGAGCAAG TGAGAAAGAT 

351 CCAGGGCGAT GGCGCAGCGC TCCAGGAGAA GCTGTGTGCC ACCTACAAGC 

401 TGTGC'.ACCC CGAGGAGCTG GTGCTGCTCG GACACTCTCT GGGCATCCCC 

451 TGGGCTCCCC TGAGCTCCTG CCCCAGCCAG GCCCTGCAGC TGGCAGGCTG 

501 CTTGAGCCAA CTCCATAGCG GCCTTTTCTA ATAA (SEQ ID NO: 89) 



pMON3464.seq 

1 ATGGCTGCCC TGGAAGGGAT ATCCCCCGAG TTGGGTCCCA CCTTGGACAC 

51 ACTGCAGCTG GACGTCGCCG ACTTTGCCAC CACCATCTGG CAGCAGATGG 

101 AAGAACTGGG AATGGCCCCT GCCCTGCAGC CCACCCAGGG TGCCATGCCG 

151 GCCTTCGCCT CTGCTTTCCA GCGCCGGGCA GGAGGGGTCC TGGTTGCTAG 

201 CCATCTGCAG AGCTTCCTGG AGGTGTCGTA CCGCGTTCTA CGCCACCTTG 

251 CGCAGCCCAC ACCATTGGGC CCTGCCAGCT CCCTGCCCCA GAGCTTCCTG 

301 CTCAAGTCTT TAGAGCAAGT GAGAAAGATC CAGGGCGATG GCGCAGCGCT 

351 CCAGGAGAAG CTGTGTGCCA CCTACAAGCT GTGCCACCCC GAGGAGCTGG 

401 TGCTGCTCGG ACACTCTCTG GGCATCCCCT GGGCTCCCCT GAGCTCCTGC 

451 CCCAGCCAGG CCCTGCAGCT GGCAGGCTGC TTGAGCCAAC TCCATAGCGG 

501 CCTTTTCCTC TACCAGGGGC TCCTGCAGTA ATAA (SEQ ID NO: 90) 



pMON3465.seq 

1 ATGGCTGACT TTGCCACCAC CATCTGGCAG CAGATGGAAG AACTGGGAAT 

51 GGCCC 'TGCC CTGCAGCCCA CCCAGGGTGC CATGCCGGCC TTCGCCTCTG 

101 CTTTCCAGCG CCGGGCAGGA GGGGTCCTGG TTGCTAGCCA TCTGCAGAGC 

151 TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC AGCCCACACC 

201 ATTGGGCCCT GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC AAGTCTTTAG 

251 AGCAAGTGAG AAAGATCCAG GGCGATGGCG CAGCGCTCCA GGAGAAGCTG 

301 TGTGCCACCT ACAAGCTGTG CCACCCCGAG GAGCTGGTGC TGCTCGGACA 

351 CTCTCTGGGC ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC AGCCAGGCCC 

401 TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCCT TTTCCTCTAC 

451 CAGGGGCTCC TGCAGGCCCT GGAAGGGATA TCCCCCGAGT TGGGTCCCAC 

501 CTTGGACACA CTGCAGCTGG ACGTCGCCTA ATAA (SEQ ID NO: 91) 
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pMON 3466. seq 

1 ATGGCTATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC CTGCCCTGCA 

5 51 GCCCACCCAG GGTGCCATGC CGGCCTTCGC CTCTGCTTTC CAGCGCCGGG 

101 CAGGAGGGGT CCTGGTTGCT AGCCATCTGC AGAGCTTCCT GGAGGTGTCG 

151 TACCCCGTTC TACGCCACCT TGCGCAGCCC ACACCATTGG GCCCTGCCAG 

201 CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA GTGAGAAAGA 

251 TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC CACCTACAAG 

10 301 CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC TGGGCATCCC 

351 CTGGGCTCCC CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG CTGGCAGGCT 

401 GCTTGAGCCA ACTCCATAGC GGCCTTTTCC TCTACCAGGG GCTCCTGCAG 

451 GCCCTGGAAG GGATATCCCC CGAGTTGGGT CCCACCTTGG ACACACTGCA 

501 GCTGGACGTC GCCGACTTTG CCACCACCTA ATAA (SEQ ID NO: 92) 



15 



30 



pMON3467.seq 



1 ATGGCTCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC TGCAGCCCAC 

20 51 CCAGGGTGCC ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC CGGGCAGGAG 

101 GGGTCCTGGT TGCTAGCCAT CTGCAGAGCT TCCTGGAGGT GTCGTACCGC 

151 GTTCTACGCC ACCTTGCGCA GCCCACACCA TTGGGCCCTG CCAGCTCCCT 

201 GCCCCAGAGC TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA AAGATCCAGG 

251 GCGATGGCGC AGCGCTCCAG GAGAAGCTGT GTGCCACCTA CAAGCTGTGC 

25 301 CACCCCGAGG AGCTGGTGCT GCTCGGACAC TCTCTGGGCA TCCCCTGGGC 

351 TCCCCTGAGC TCCTGCCCCA GCCAGGCCCT GCAGCTGGCA GGCTGCTTGA 

401 GCCAACTCCA TAGCGGCCTT TTCCTCTACC AGGGGCTCCT GCAGGCCCTG 

451 GAAGGGATAT CCCCCGAGTT GGGTCCCACC TTGGACACAC TGCAGCTGGA 

501 CGTCGnCGAC TTTGCCACCA CCATCTGGTA ATAA (SEQ ID NO: 93) 



pMON3499.seq 



1 ATGGCTTTGT TAGGACATTC TTTAGGTATT CCATGGGCTC CTCTGAGCTC 

35 51 CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC CAACTCCATA 

101 GCGGCCTTTT CCTCTACCAG GGGCTCCTGC AGGCCCTGGA AGGGATATCC 

151 CCCGAGTTGG GTCCCACCTT GGACACACTG CAGCTGGACG TCGCCGACTT 

201 TGCCACCACC ATCTGGCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC 

251 TGCAGCCCAC CCAGGGTGCC ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC 

40 301 CGGGCAGGAG GGGTCCTGGT TGCTAGCCAT CTGCAGAGCT TCCTGGAGGT 

351 GTCGTACCGC GTTCTACGCC ACCTTGCGCA GCCCACACCA TTGGGCCCTG 

401 CCAGCTCCCT GCCCCAGAGC TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA 

451 AAGATCCAGG GCGATGGCGC AGCGCTCCAG GAGAAGCTGT GTGCCACCTA 

501 CAAGCTGTGC CACCCCGAGG AGCTGGTGTA ATAA (SEQ ID NO: 94) 

45 
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TABLE 1 
PROTEIN SEQUENCES 

pMON3485.Pep 

5 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu.Gly His Ser 

Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala 

Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe 

Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu 

10 Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 

Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 

Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 

Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 

15 Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser 

Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 
Glu Lys Leu Cys Ala Thr (SEQ ID NO: 43) 



20 pMON3486.Pep 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 

25 Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 
Ala Gin Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu 
Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala 
Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu 

30 Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro 
Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu 
Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 
Ala Leu Glu Gly He Ser (SEQ ID NO: 44) 

35 

PMON3487 .Pep 

Met Ala Pr.> Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe 
Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser 

40 His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His 
Leu Ala Gin Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu 
Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala 
Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro 
Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala 

45 Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 
Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu 
Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp 
Gin Met Glu Glu Leu Gly (SEQ ID NO: 45) 



50 

pMON3488.Pep 
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10 



15 



20 



25 



30 



Ala Pro Ala Leu Gin Pro (SEQ ID NO: 46) 
pMON3489.Pep 



Ala 


Phe 


Gin 


Arg 


Arg 


Gin 


Ser 


Phe 


Leu 


Glu 


Gin 


Pro 


Ser 


Gly Gly 


Ser 


Leu 


Glu 


Gin 


Val 


Gin 


Glu 


Lys 


Leu 


Cys 


Leu 


Val 


Leu 


Leu 


Gly 


Ser 


Ser 


Cys 


Pro 


Ser 


Gin 


Leu 


His 


Ser 


Gly 


Leu 


Glu 


Gly 


He 


Ser 


Gin 


Leu 


Asp 


Val 


Ala 


Glu 


Glu 


Leu 


Gly Met 


Leu 


Val 


Ala 


Ser 


His 


Val 


Leu 


Arg 


His 


Leu 


Gin 


Ser 


Phe 


Leu 


Leu 


Gly Asp 


Gly 


Ala 


Ala 


Leu Cys 


His 


Pro 


Glu 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ala 


Gly 


Cys 


Leu 


Gin Gly 


Leu 


Leu 


Gin 


Pro 


Thr 


Leu 


Asp 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Gin 


Pro 


Thr 


Gin Gly 



Ala Met Pro Ala Phe Ala (SEQ ID NO: 47) 
pMON3490.Pep 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 

35 Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala 

Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe 

Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu 

Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 

Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

40 Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 

Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 

Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 

Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu 

Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala 
45 Ala Leu Gin Glu Lys Leu Cys Ala Thr (SEQ ID NO: 48) 

pMON3491.Pep 

50 Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
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Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 

Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 

Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser 

Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys lie Gin Gly Asp 

5 Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 

His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly lie Pro 

Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala 

Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly 

Leu Leu Gin Ala Leu Glu Gly He Ser (SEQ ID NO:49) 

10 



pMON3492.Pep 





Met 


Ala 


Pro 


Ala 


Leu 


Gin Pro Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


15 


Ala 


Ser 


Ala 


Phe 


Gin Arg Arg Ala 


Gly 


Gly 


Val 


Leu 


Val 


Ala 


Ser 




His 


Leu 


Gin 


Ser 


Phe 


Leu Glu Val 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 




Leu 


Ala 


Gin 


Pro 


Thr 


Pro Leu Gly 


Pro 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 




Ser 


Phe 


Leu 


Leu 


Lys 


Ser Leu Glu 


Gin 


Val 


Arg 


Lys 


He Gin Gly 




Asp 


Gly 


Ala 


Ala 


Leu 


Gin Glu Lys 


Leu 


Cys 


Ala 


Thr 


Tyr 


Lys 


Leu 


20 


Cys 


His 


Pro 


Glu 


Glu 


Leu Val Leu 


Leu 


Gly 


His 


Ser 


Leu Gly 


He 




Pro 


Trp 


Ala 


Pro 


Leu 


Ser Ser Cys 


Pro 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 




Ala 


Gly 


Cys 


Leu 


Ser 


Gin Leu His 


Ser 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 




Gly 


Leu 


Leu 


Gin 


Ala 


Leu Glu Gly 


He 


Ser 


Pro 


Glu 


Leu 


Gly 


Pro 




Thr 


Leu 


Asp 


Thr 


Leu 


Gin Leu Asp 


Val 


Ala 


Asp 


Phe 


Ala 


Thr 


Thr 


25 


He 


Trp 


Gin 


Gin 


Met 


Glu Glu Leu Gly 


(SEQ ID NO:50) 







pMON3493.Pep 

3 0 Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg 

Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 

Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu 

Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu 

Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 

35 Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 

Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser 

Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 

His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu 
Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu 

40 Asp Val Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu 
Leu Gly Met Ala Pro Ala Leu Gin Pro (SEQ ID NO: 51) 



pMON3494.Pep 

45 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 

Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 

Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser 

Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp 

50 Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 

His Pro Glu Glu Leu Val L u Leu Gly His Ser -Leu Gly He Pro 

Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala 



WO 97/12977 





Gly 


Cys 


Leu 


Ser 




Leu 


Leu 


Gin 


Ala 




Leu 


Asp 


Thr 


Leu 




Trp 


Gin 


Gin 


Met 


5 


Thr 


Gin 


Gly 


Ala 




pMON25181.pep 


10 


Pro 


Glu 


Leu 


Gly 




Asp 


Phe 


Ala 


Thr 




Ala 


Pro 


Ala 


Leu 




Ser 


Ala 


Phe 


Gin 




Leu 


Gin 


Ser 


Phe 


15 


Ala 


Gin 


Pro 


Thr 




Phe 


Leu 


Leu 


Lys 




Gly 


Ala 


Ala 


Leu 




His 


Pro 


Glu 


Glu 




Trp 


Ala 


Pro 


Leu 


20 


Gly 


Cys 


Leu 


Ser 




Leu 


Leu 


Gin 


Ala 




pMON2 5182.pep 


2d 












Met 


Ala 


Pro 


Ala 




Ala 


Ser 


Ala 


Phe 




His 


Leu 


Gin 


Ser 




Leu 


Ala 


Gin 


Pro 


30 


Ser 


Phe 


Leu 


Leu 




Asp 


Gly 


Ala 


Ala 




Cys 


His 


Pro 


Glu 




Pro 


Trp 


Ala 


Pro 




Ala 


Gly 


Cys 


Leu 


35 


Gly 


Leu 


Leu 


Gin 




Thr 


Leu 


Asp 


Thr 




He 


Trp 


Gin 


Gin 


40 


pMON25183.pep 




Thr 


Gin 


Gly Ala 




Ala 


Gly 


Gly Val 




Val 


Ser 


Ty_- 


Arg 


45 


Gly 


Pro 


Ala 


Ser 




Glu 


Gin 


Val 


Arg 




Lys 


Leu 


Cys 


Ala 




Leu 


Leu Gly 


His 




Cys 


Pro 


Ser 


Gin 


50 


His 


Ser Gly 


Leu 




Gly 


He 


Ser 


Pro 



44 



Gin Leu His Ser Gly 
Leu Glu Gly lie Ser 
Gin Leu Asp Val Ala 
Glu Glu Leu Gly Met 
Met Pro Ala Phe Ala 



Pro Thr Leu Asp Thr 
Thr He Trp Gin Gin 
Gin Pro Thr Gin Gly 
Arg Arg Ala Gly Gly 
Leu Glu Val Ser Tyr 
Pro Leu Gly Pro Ala 
Ser Leu Glu Gin Val 
Gin Glu Lys Leu cys 
Leu Val Leu Leu Gly 
Ser Ser Cys Pro Ser 
Gin Leu His Ser Gly 
Leu Glu Gly He Ser 



Leu Gin Pro Thr Gin 
Gin Arg Arg Ala Gly 
Phe Leu Glu Val Ser 
Thr Pro Leu Gly Pro 
Lys Ser Leu Glu Gin 
Leu Gin Glu Lys Leu 
Glu Leu Val Leu Leu 
Leu Ser Ser Cys Pro 
Ser Gin Leu His Ser 
Ala Leu Glu Gly He 
Leu Gin Leu Asp Val 
Met Glu Glu Leu Gly 



Met Pro Ala Phe Ala 
Leu Val Ala Ser His 
Val Leu Arg His Leu 
Ser Leu Pro Gin Ser 
Lys He Gin Gly Asp 
Thr Tyr Lys Leu Cys 
Ser Leu Gly He Pro 
Ala Leu Gin Leu Ala 
Phe Leu Tyr Gin Gly 
Glu Leu Gly Pro Thr 



PCT/US96/15935 



Leu Phe Leu Tyr Gin Gly 
Pro Glu Leu Gly Pro Thr 
Asp Phe Ala Thr Thr He 
Ala Pro Ala Leu Gin Pro 
(SEQ ID NO: 52) 



Leu Gin Leu Asp Val Ala 
Met Glu Glu Leu Gly Met 
Ala Met Pro Ala Phe Ala 
Val Leu Val Ala Ser His 
Arg Val Leu Arg His Leu 
Ser Ser Leu Pro Gin Ser 
Arg Lys He Gin Gly Asp 
Ala Thr Tyr Lys Leu Cys 
His Ser Leu Gly He Pro 
Gin Ala Leu Gin Leu Ala 
Leu Phe Leu Tyr Gin Gly 
(SEQ ID NO: 53) 



Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Gly 


Val 


Leu 


Val 


Ala 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 


Val 


Arg 


Lys 


He 


Gin Gly 


Cys 


Ala 


Thr 


Tyr 


Lys 


Leu 


Gly 


His 


Ser 


Leu 


Gly 


He 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Ser 


Pro 


Glu 


Leu 


Gly 


Pro 


Ala 


Asp 


Phe 


Ala 


Thr 


Thr 


(SEQ ID 


NO:54) 






Ser 


Ala 


Phe 


Gin 


Arg 


Arg 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu 


Ala 


Gin 


Pro 


Thr 


Pro 


Leu 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu 


Gly 


Ala 


Ala 


Leu 


Gin 


Glu 


His 


Pro 


Glu 


Glu 


Leu 


Val 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Gly Cys 


Leu 


Ser 


Gin 


Leu 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 
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Asp Val Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu 
Leu Gly Met Ala Pro Ala Leu Gin Pro (SEQ ID NO: 55) 



5 pMON2 5184, pep 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
Leu Gin Sei Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 
Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser 

10 Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp 
Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 
His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro 
Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala 
Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly 

15 Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr 
Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro 
Thr Gin Gly Ala Met Pro Ala Phe Ala (SEQ ID NO: 56) 

20 

pMON25185.pep 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 

25 Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 
Ala Gin Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu 
Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala 

30 Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu 
Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro 
Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu 
Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 
Ala Leu Glu Gly He Ser (SEQ ID NO: 57) 

35 



pMON25186.pep 





Met 


Ala 


Pro 


Ala 


Leu Gin Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


40 


Ala 


Ser 


Ala 


Phe 


Gin Arg Arg Ala 


Gly 


Gly 


Val 


Leu 


Val 


Ala 


Ser 




His 


Leu 


Gin 


Ser 


Phe Leu Glu 


Val 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 




Leu 


Ala 


Gin 


Pro 


Ser Gly Gly 


Ser 


Gly 


Gly 


Ser 


Gin 


Ser 


Phe 


Leu 




Leu 


Lys 


Ser 


Leu 


Glu Gin Val 


Arg 


Lys 


He 


Gin 


Gly 


Asp 


Gly Ala 




Ala 


Leu 


Gin 


Glu 


Lys Ala Thr Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


45 


Leu 


Val 


Leu 


Leu 


Gly His Ser 


Leu 


Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 




Ser 


Ser 


Cys 


Pro 


Ser Gin Ala 


Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 




Gin 


Leu 


His 


Ser 


Gly Leu Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 




Leu 


Glu 


Gly 


He 


Ser Pro Glu 


Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 




Gin 


Leu 


Asp 


Val 


Ala Asp Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


50 


Glu 


Glu 


Leu 


Gly 


(SEQ ID NO:! 


58) 
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PMON25187 .pep 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg 
Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 
5 Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Ser Gly Gly 

Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys 
Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly 
His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser 
10 Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
Ala Pro Ala Leu Gin Pro (SEQ ID NO: 59) 

15 





pMON25188.pep 
























Ser 


Ala 


Phe 


Gin 


Arg 


Arg Ala 


Gly 


Gly 


Val 


Leu 


Val 


Ala 


Ser 


His 


20 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu Val 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 


Leu 




Ala 


Gin 


Pro 


Ser 


Gly 


Gly Ser 


Gly 


Gly 


Ser 


Gin 


Ser 


Phe 


Leu 


Leu 




Lys 


Ser 


Leu 


Glu 


Gin 


Val Arg 


Lys 


lie 


Gin 


Gly 


Asp 


Gly 


Ala 


Ala 




Leu 


Gin 


Glu 


Lys 


Leu 


Cys Ala 


Thr 


Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 




Glu 


Leu 


Val 


Leu 


Leu Gly His 


Ser 


Leu 


Gly 


He 


Pro 


Trp 


Ala 


Pro 


25 


Leu 


Ser 


Ser 


Cys 


Pro 


Ser Gin 


Ala 


Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 




Ser 


Gin 


Leu 


His 


Ser Gly Leu 


Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 




Ala 


Leu 


Glu 


Gly 


He 


Ser Pro 


Glu 


Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 




Leu 


Gin 


Leu 


Asp 


Val 


Ala Asp 


Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 




Met 


Glu 


Glu 


Leu 


Gly Met Ala 


Pro 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


30 


Ala 


Met 


Pro 


Ala 


Phe 


Ala (SEQ ID 


NO: 60) 












pMON3460.Pep 
























Leu 


Leu 


Gly 


His 


Ser 


Leu Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


35 


Cys 


Pro 


Ser 


Gin 


Ala 


Leu Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 




His 


Ser 


Gly 


Leu 


Phe 


Leu Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 




Gly 


He 


Ser 


Pro 


Glu 


Leu Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu. 




Asp 


Val 


Ala 


Asp 


Phe 


Ala Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 




Leu 


Gly 


Met 


Ala 


Pro 


Ala Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


40 


Ala 


Phe 


Ala 


Ser 


Ala 


Phe Gin 


Arg 


Arg 


Ala 


Gly 


Gly 


Val 


Leu 


Val 




Ala 


Ser 


His 


Leu 


Gin 


Ser Phe 


Leu 


Glu 


Val 


Ser 


Tyr 


Arg 


Val 


Leu 




Arg 


His 


Leu 


Ala 


Gin 


Pro Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


Ser 


Leu 




Pro 


Gin 


Ser 


Phe 


Leu 


Leu Lys 


Ser 


Leu 


Glu 


Gin 


Val 


Arg 


Lys 


He 




Gin 


Gly 


Asp 


Gly 


Ala 


Ala Leu 


Gin 


Glu 


Lys 


Leu 


Cys 


Ala 


Thr 


Tyr 


45 


Lys 


Leu 


Cys 


His 


Pro 


Glu Glu 


Leu 


Val 


(SEQ ID 


NO:95) 







pMON3461.Pep 

50 Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala 

Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu 

Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met 



(1 
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Glu Glu Leu Gly Met Ala Pro Ala 
' Met Pro Ala Phe Ala Ser Ala Phe 
Leu Val Ala Ser His Leu Gin Ser 
Val Leu Arg His Leu Ala Gin Pro 
5 Ser Leu Pro Gin Ser Phe Leu Leu 
Lys lie Gin Gly Asp Gly Ala Ala 
Thr Tyr Lys Leu Cys His Pro Glu 
Ser Leu Gly lie Pro Trp Ala Pro 
Ala Leu Gin Leu Ala Gly Cys Leu 



Leu Gin Pro Thr Gin Gly Ala 
Gin Arg Arg Ala Gly Gly Val 
Phe Leu Glu Val Ser Tyr Arg 
Thr Pro Leu Gly Pro Ala Ser 
Lys Ser Leu Glu Gin Val Arg 
Leu Gin Glu Lys Leu Cys Ala 
Glu Leu Val Leu Leu Gly His 
Leu Ser Ser Cys Pro Ser Gin 
Ser (SEQ ID NO:96) 



3462. Pep 





Leu 


Phe 


Leu 


Tyr 


15 


Pro 


Glu 


Leu 


Gly 




Asp 


Phe 


Ala 


Thr 




Ala 


Pro 


Ala 


Leu 




Ser 


Ala 


Phe 


Gin 




Leu 


Gin 


Ser 


Phe 


20 


Ala 


Gin 


Pro 


Thr 




Phe 


Leu 


Leu 


Lys 




Gly 


Ala 


Ala 


Leu 




His 


Pro 


Glu 


Glu 




Trp 


Ala 


Pro 


Leu 


25 


Gly 


Cys 


Leu 


Ser 



Gin Gly Leu Leu Gin Ala 
Pro Thr Leu Asp Thr Leu 
Thr lie Trp Gin Gin Met 
Gin Pro Thr Gin Gly Ala 
Arg Arg Ala Gly Gly Val 
Leu Glu Val Ser Tyr Arg 
Pro Leu Gly Pro Ala Ser 
Ser Leu Glu Gin Val Arg 
Gin Glu Lys Leu Cys Ala 
Leu Val Leu Leu Gly His 
Ser Ser Cys Pro Ser Gin 
Gin Leu His Ser Gly (SB 



Leu Glu Gly lie Ser 

Gin Leu Asp Val Ala 

Glu Glu Leu Gly Met 

Met Pro Ala Phe Ala 

Leu Val Ala Ser His 

Val Leu Arg His Leu 

Ser Leu Pro Gin Ser 

Lys lie Gin Gly Asp 

Tr.r Tyr Lys Leu Cys 

Ser Leu Gly lie Pro 

Ala Leu Gin Leu Ala 

! ID NO:97) 





34 63. Pep 




30 


Leu 


Tyr 


Gin 


Gly 




Leu 


Gly 


Pro 


Thr 




Ala 


Thr 


Thr 


He 




Ala 


Leu 


Gin 


Pro 




Phe 


Gin 


Arg 


Arg 


35 


Ser 


Phe 


Leu 


Glu 




Pro 


Thr 


Pro 


Leu 




Leu 


Lys 


Ser 


Leu 




Ala 


Leu 


Gin 


Glu 




Glu 


Glu 


Leu 


Val 


40 


Pro 


Leu 


Ser 


Ser 




Leu 


Ser 


Gin 


Leu 



Leu Leu Gin Ala Leu 
Leu Asp Thr Leu Gin 
Trp Gin Gin Met Glu 
Thr Gin Gly Ala Met 
Ala Gly Gly Val Leu 
Val Ser Tyr Arg Val 
Gly Pro Ala Ser Ser 
Glu Gin Val Arg Lys 
Lys Leu Cys Ala Thr 
Leu Leu Gly His Ser 
Cys Pro Ser Gin Ala 
His Ser Gly Leu Phe 



Glu Gly lie- Ser Pro Glu 
Leu Asp Val Ala Asp Phe 
Glu Leu Gly Met Ala Pro 
Pro Ala Phe Ala Ser Ala 
Val Ala Ser His Leu Gin 
Leu Arg His Leu Ala Gin 
Leu Pro Gin Ser Phe Leu 
He Gin Gly Asp Gly Ala 
Tyr Lys Leu Cys His Pro 
Leu Gly He Pro Trp Ala 
Leu Gin Leu Ala Gly Cys 
(SEQ ID NO: 98) 



3464. Pep 

45 

Ala Leu Glu Gly He 
Leu Gin Leu Asp Val 
Met Glu Gli Leu Gly 
Ala Met Pro Ala Phe 
50 Val Leu Val Ala S r 
Arg Val L u Arg His 
Ser Ser Leu Pro Gin 



Ser Pro Glu Leu Gly 
Ala Asp Phe Ala Thr 
Met Ala Pro Ala Leu 
Ala Ser Ala Phe Gin 
His Leu Gin Ser Phe 
Leu Ala Gin Pro Thr 
Ser Phe Leu Leu Lys 



Pro Thr Leu Asp Thr 
Thr He Trp Gin Gin 
Gin Pro Thr Gin Gly 
Arg Arg Ala Gly Gly 
Leu Glu Val Ser Tyr 
Pro Leu Gly Pro Ala 
Ser Leu Glu Gin Val 
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Arg Lys lie Gin Gly Asp Gly Ala 
Ala Thr Tyr Lys Leu Cys His Pro 
His Ser Leu Gly He Pro Trp Ala 
Gin Ala Leu Gin Leu Ala Gly Cys 
5 Leu Phe Leu Tyr Gin Gly Leu Leu 



Ala Leu Gin Glu Lys Leu Cys 

Glu Glu Leu Val Leu Leu Gly 

Pro Leu Ser Ser Cys Pro Ser 

Leu Ser Gin Leu His Ser Gly 
Gin (SEQ ID NO: 99) 





3465. Pep 








10 


Asp 


Phe 


Ala 


Thr 


Thr 


He Trp 




Ala 


Pro 


Ala 


Leu 


Gin 


Pro Thr 




Ser 


Ala 


Phe 


Gin 


Arg 


Arg Ala 




Leu 


Gin 


Ser 


Phe 


Leu 


Glu Val 




Ala 


Gin 


Pro 


Thr 


Pro 


Leu Gly 


15 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu Glu 




Gly 


Ala 


Ala 


Leu 


Gin 


Glu Lys 




His 


Pro 


Glu 


Glu 


Leu 


Val Leu 




Trp 


Ala 


Pro 


Leu 


Ser 


Ser Cys 




Gly 


Cys 


Leu 


Ser 


Gin 


Leu His 


20 


Leu 


Leu 


Gin 


Ala 


Leu Glu Gly 




Leu 


Asp 


Thr 


Leu 


Gin 


Leu Asp 



Gin Gin Met Glu Glu Leu Gly Met 
Gin Gly Ala Met Pro Ala Phe Ala 
Gly Gly Val Leu Val Ala Ser His 
Ser Tyr Arg Val Leu Arg His Leu 
Pro Ala Ser Ser Leu Pro Gin Ser 
Gin Val Arg Lys He Gin Gly Asp 
Leu Cys Ala Thr Tyr Lys Leu Cys 
Leu Gly His Ser Leu Gly He Pro 
Pro Ser Gin Ala Leu Gin Leu Ala 
Ser Gly Leu Phe Leu Tyr Gin Gly 
He Ser Pro Glu Leu Gly Pro Thr 
Val Ala (SEQ ID NO: 100) 



25 



3466. Pep 




















lie 


Trp 


Gin 


Gin 


Met 


Glu Glu Leu Gly Met 


Ala 


Pro 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met Pro Ala 


Phe Ala 


Ser 


Ala 


Phe 


Gin 


Arg 


Arg 


Ala 


Gly 


Gly Val 


Leu Val Ala 


Ser His 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu 


Val 


Ser 


Tyr 


Arg 


Val Leu Arg 


His Leu 


Ala 


Gin 


Pro 


Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


Ser Leu Pro 


Gin Ser 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Gin 


Val 


Arg 


Lys He Gin 


Gly Asp 


Gly 


Ala 


Ala 


Leu 


Gin 


Glu 


Lys 


Leu 


Cys 


Ala 


Thr Tyr Lys 


Leu Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 


Leu 


Leu 


Gly 


His 


Ser Leu Gly 


He Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Cys 


Pro 


Ser 


Gin 


Ala Leu Gin 


Leu Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 


Gly 


Leu 


Phe Leu Tyr Gin Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


Ser 


Pro Glu Leu Gly 


Pro Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 


Phe Ala Thr 


Thr (SEQ ID 


NO: : 


101) 







40 3467. Pep 

Gin Gin Met Glu Glu Leu Gly 
Gin Gly Ala Met Pro Ala Phe 
Gly Gly Val Leu Val Ala Ser 

45 Ser Tyr Arg Val Leu Arg His 
Pro Ala Ser Ser Leu Pro Gin 
Gin Val Arg Lys He Gin Gly 
Leu Cys Ala Thr Tyr Lys Leu 
Leu Gly His Ser Leu Gly He 

50 Pro Ser Gin Ala Leu Gin Leu 
Ser Gly Leu Phe Leu Tyr Gin 
lie Ser Pro Glu Leu Gly Pro 



Met Ala Pro Ala Leu Gin Pro Thr 

Ala Ser Ala Phe Gin Arg Arg Ala 

His Leu Gin Ser Phe Leu Glu Val 

Leu Ala Gin Pro Thr Pro Leu Gly 

Ser Phe Leu Leu Lys Ser Leu Glu 

Asp Gly Ala Ala Leu Gin Glu Lys 

Cys His Pro Glu Glu Leu Val Leu 

Pro Trp Ala Pro Leu Ser Ser Cys 

Ala Gly Cys Leu Ser Gin Leu His 

Gly Leu Leu Gin Ala Leu Glu Gly 

Thr Leu Asp Thr Leu Gin Leu Asp 
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Val Ala As 9 Phe Ala Thr Thr lie Trp (SEQ ID NO: 102) 
3499 .Pep 

5 



Leu 


Leu 


Gly 


His 


Ser 


Leu 


Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Cys 


Pro 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 


Ala 


Gly Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Gly Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


Ser 


Pro Glu Leu Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 


Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


Met 


Ala 


Pro 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Ala 


Ser 


Ala 


Phe 


Gin 


Arg 


Arg 


Ala 


Gly Gly 


Val 


Leu 


Val 


Ala 


Ser 


His 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu 


Val 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 


Leu 


Ala 


Gin 


Pro 


Thr 


Pro 


Leu Gly 


Pro 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Gin 


Val 


Arg 


Lys 


He 


Gin 


Gly 


Asp Gly Ala Ala Leu 


Gin 


Glu 


Lys 


Leu Cys 


Ala 


Thr 


Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 


(SEQ ID 


NO: 103) 
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Materials and M P rh^p 

Recombinant DNA mot-hods 
5 Unless noted otherwise, all specialty chemicals were 

obtained from Sigma Co., (St. Louis, MO). Restriction 
endonuc leases and T4 DNA ligase were obtained from New 
England Biolabs (Beverly, MA) or Boehringer Mannheim 
(Indianapolis, IN) . 

10 

Transformation of E. nnli strains 

E. coli strains, such as DHSof* 4 (Life Technologies, 
Gaithersburg, MD) and TGI (Amersham Corp . , Arlington 
15 Heights, ID are used for transformation of ligation 

reactions and are the source of plasmid DNA for trans feet ing 
mammalian cells. E. coli strains, such as MON105 and JM101, 
can be used for expressing the G-CSF receptor agonist of the 
present invention in the cytoplasm or periplasmic space. 

20 

MON105 ATCC#55204: F-, lamda- , IN( rrnD, rrE)l, rpoD+, rpoH358 

DH5a™ : F-, phi80dlacZdeltaM15, delta ( lacZYA-argF) U169 , 
deoR, recAl, endAl, hsdR17 (rk-,mk+) , phoA, supE441amda- , 
25 thi-1, gyrA96, relAl 

TGI: delta ( lac-pro) , supE, thi-1, hsdD5/F' (traD36, proA+B+. 
laclq, lacZdeltaM15) 

30 DH5a™ Subcloning efficiency cells are purchased as 

competent cells and are ready for transformation using the 
manufacturer's protocol, while both E. coli strains TGI and 
MON105 are rendered competent to take up DNA using a CaCl2 
method. Typically, 20 to 50 mL of cells rre grown in LB 

35 medium (1% Bacto-tryptone, 0.5% Bacto-yeast extract. 150 mM 
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NaCl) to a density of approximately 1.0 optical density unit 
at 600 nanometers (OD600) as measured by a Baush & Lomb 
Spectronic spectrophotometer (Rochester, NY) . The cells are 
collected by centrifugation and resuspended in one-fifth 
5 culture volume of CaCl2 solution (50 mM CaCl2, 10 mM Tris- 
Cl, pH7.4) and are held at 4'C for 30 minutes. The cells 
are again collected by centrifugation and resuspended in 
one-tenth culture volume of CaCl2 solution. Ligated DNA is 
added to 0.2mL of these cells, and the samples are held at 

10 4 # C for 1 hour. The samples are shifted «-o 42 'C for two 

minutes and ImL of LB is added prior to shaking the samples 
at 37 *C for one hour. Cells from these samples are spread 
on plates (LB medium plus 1.5% Bacto-agar) containing either 
ampicillin (100 micrograms /mL, ug/mL) when selecting for 

15 ampicillin-resistant transf ormants, or spectinomycin (75 
ug/mL) when selecting for spectinomycin-resistant 
transf ormants . The plates are incubated overnight at 37 *C. 
Single colonies are picked, grown in LB supplemented with 
appropriate antibiotic for 6-16 hours at 37 *C with shaking. 

20 Colonies are picked and inoculated into LB plus 

appropriate antibiotic (100 ug/mL ampicillin or 75 ug/mL 
spectinomycin) and are grown at 37°C while shaking. Before 
harvesting the cultures, 1 ul of cells are analyzed by PCR 
for the presence of a G-CSF gene. The PCR is carried out 

25 using a combination of primers that anneal to the G-CSF gene 
and/or vector. After the PCR is complete, loading dye is 
added to the sample followed by electrophoresis as described 
earlier. \ gene has been ligated to the vector when a PCR 
product of the expected size is observed. 

30 

Methods for creation of genes with new N-terminus/C- terminus 



Method I. Creation of genes with new N-terminus/C-terminus 
which contain a linker region. 
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Genes with new N-terminus/C-terminus which contain a 
linker region separating the original C-terminus and N- 
terminus can be made essentially following the method 
described in L. S. Mullins, et al J. Am. Chem. Soc. 116, 
5 5529-5533 '.1994). Multiple steps of polymerase chain 

reaction (PCR) amplifications are used to rearrange the DNA 
sequence encoding the primary amino acid sequence of the 
protein. The steps are illustrated in Figure 2. 

10 In the first step, the primer set ("new start" and 

"linker start") is used to create and amplify, from the 
original gene sequence, the DNA fragment ("Fragment Start") 
that contains the sequence encoding the new N-terminal 
portion of the new protein followed by the linker that 

15 connects the C-terminal and N-terminal ends of the original 
protein. In the second step, the primer set ("new stop" and 
"linker stop") is used to create and amplify, from the 
original gene sequence, the DNA fragment ("Fragment Stop") 
that encodes the same linker as used above, followed by the 

20 new C-terminal portion of the new protein. The "new start" 
and "new stop" primers are designed to include the 
appropriate restriction enzyme recognition sites which allow 
cloning of the new gene into expression plasmids . Typical 
PCR conditions are one cycle 95°C melting for two minutes; 

25 25 cycles 94°C denaturation for one minute, 50°C annealing 
for one minute and 72°C extension for one minute; plus one 
cycle 72°C extension for seven minutes. A Perkin Elmer 
GeneAmp PCR Core Reagents kit is used. A 100 ul reaction 
contains 100 pmole of each primer and one ug of template 

30 DNA; and lx PCR buffer, 200 uM dGTP, 200 uM dATP, 200 uM 

dTTP, 200 uM dCTP, 2.5 units AmpliTaq DNA polymerase and 2 
mM MgCl2- PCR reactions are performed in a Model 480 DNA 
thermal cycler (Perkin Elmer Corporation, Norwalk, CT) . 
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"Fragment Start" and "Fragment Stop", which have 
complementary sequence in the linker region and the coding 
sequence for the two amino acids on both sides of the 
linker, are joined together in a third PCR step to make the 
5 full-length gene encoding the new protein. The DNA 

fragments "Fragment Start" and "Fragment Stop" are resolved 
on a 1% TAE gel, stained with ethidium bromide and isolated 
using a Qiaex Gel Extraction kit (Qiagen) . These fragments 
are combined in equimolar quantities, heated at 70°C for ten 

10 minutes and slow cooled to allow annealing through their 

shared sequence in "linker start" and "linker stop". In the 
third PCR step, primers "new start" and "new stop" are added 
to the annealed fragments to create and amplify the full- 
length new N-terminus/C-terminus gene. Typical PCR 

15 conditions are one cycle 95°C melting for two minutes; 25 

cycles 94°C denaturation for one minute, 60°C annealing for 
one minute and 72°C extension for one minute; plus one cycle 
72°C extension for seven minutes. A Perkin Elmer GeneAmp 
PCR Core Reagents kit is used. A 100 ul reaction contains 

20 100 pmole of each primer and approximately 0.5 ug of DNA; 

and lx PCR buffer, 200 uM dGTP, 200 uM dATP, 200 uM dTTP, 
200 uM dCTP, 2.5 units AmpliTaq DNA polymerase and 2 mM 
MgCl2- PCR reactions are purified using a Wizard PCR Preps 
kit (Pr omega) . 

25 

Method II. Creation of genes with new N-terminus/C-terminus 
without a linker region. 

New N-terminus/C-terminus genes without a linker 
30 joining the original N-terminus and C- terminus can be made 
using two steps of PCR amplification and a blunt end 
ligation. The steps are illustrated in Figure 3. In the 
first step, the primer set ("new start" and "P-bl start") is 
used to create and amplify, from the original gene sequence, 
35 the DNA fragment ("Fragment Start") that contains the 
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sequence encoding the new N-terminal portion of the new 
protein. In the second step, the primer set ("new stop" and 
"P-bl stop") is used to create and amplify, from the 
original gene sequence, the DNA fragment ("Fragment Stop") 
5 that contains the sequence encoding the new C-terminal 

portion of the new protein. The "new start" and "new stop" 
primers are designed to include appropriate restriction 
sites which allow cloning of the new gene into expression 
vectors. Typical PCR conditions are one cycle 95°C melting 

10 for two minutes; 25 cycles 94°C denaturation for one minute, 
50°C annealing for 45 seconds and 72°C extension for 45 
seconds. Deep Vent polymerase (New England Biolabs) is used 
to reduce the occurrence of overhangs in conditions 
recommended by the manufacturer. The "P-bl start" and "P-bl 

15 stop" primers are phosphorylated at the 5 # end to aid in the 
subsequent blunt end ligation of "Fragment Start" and 
"Fragment Stop" to each other. A 100 ul reaction contained 
150 pmole of each primer and one ug of template DNA; and lx 
Vent buffer (New England Biolabs), 300 uM dGTP, 300 uM dATP, 

20 300 uM dTTP, 300 uM dCTP, and 1 unit Deep Vent polymerase. 

PCR reactions are performed in a Model 480 DNA thermal 
cycler (Perkin Elmer Corporation, Norwalk, CT) . PCR 
reaction products are purified using a Wizard PCR Preps kit 
(Promega) . 

25 

The primers are designed to include appropriate 
restriction enzyme recognition sites which allow for the 
cloning of the new gene into expression vectors. Typically 
"Fragment Start" is designed to create a Ncol restriction 

30 site , and M Fragment Stop" is designed to create a Hindlll 

restriction site. Restriction digest reactions are purified 
using a Magic DNA Clean-up System kit (Promega). Fragments 
Start and Stop are resolved on a 1% TAE gel, stained with 
ethidium bromide and isolated using a Qiaex Gel Extraction 

35 kit (Qiagen) . These fragments are combined with and 
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annealed to the ends of the - 3800 base pair Ncol/Hindlll 
vector fragment of pMON3934 by heating at 50°C for ten 
minutes and allowed to slow cool. The three fragments are 
ligated together using T4 DNA ligase (Boehringer Mannheim) . 
5 The result is a plasmid containing the full-length new N- 

terminus/C-terminus gene. A portion of the ligation reaction 
is used to transform E. coli strain DHSacells (Life 

Technologies, Gaithersburg, MD) . Plasmid DNA is purified 
and sequence confirmed as below. 

10 

Method III. Creation of new N-terminus/C-terminus genes by 
tandem-duplication method 

New N-terminus/C-terminus genes can be made based on 
15 the method described in R. A. Horlick, et al Protein Eng. 
5:427-431 (1992) . Polymerase chain reaction (PCR) 
amplif ication of the new N-terminus/C-terminus genes is 
performed using a tandemly duplicated template DNA. The 
steps are illustrated in Figure 4. 

20 

The tandemly-duplicated template DNA is created by 
cloning and contains two copies of the gene separated by DNA 
sequence encoding a linker connecting the original C- and N- 
terminal ends of the two copies of the gene. Specific 

25 primer sets are used to create and amplify a full-length new 
N terminus/C-terminus gene from the tandemly-duplicated 
template DNA. These primers are designed to include 
appropriate restriction sites which allow for the cloning of 
the new gene into expression vectors. Typical PCR conditions 

30 are one cycle 95°C melting for two minutes; 25 cycles 94°C 
denaturation for one minute, 50°C annealing for one minute 
and 72°C extension for one minute; plus one cycle 72°C 
extension for seven minutes. A Perkin Elmer GeneAmp PCR 
Core Reagents kit (Perkin Elmer Corporation, Norwalk, CT) is 

35 used. A 100 ul reaction contains 100 pmole of each primer 
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and one ug of template DNA; and Ix PCR buffer, 200 uM dGTP, 
200 uM dAT?, 200 uM dTTP, 200 uM dCTP, 2.5 units AmpliTaq 
DNA polymerase and 2 mM MgCl 2 . PCR reactions are performed 
in a Model 480 DNA thermal cycler (Perkin Elmer Corporation, 
5 Norwalk, CT) . PCR reactions are purified using a Wizard PCR 

Preps kit (Promega). 

DNA isolation and charartpyj Tifflf jft" 

10 Plasmid DNA can be isolated by a number of different 

methods and using commercially available kits known to those 
skilled in the art. A few such methods are shown herein. 
Plasmid DNA is isolated using the Promega Wizard™ Miniprep 
kit (Madison, WI) , the Qiagen QIAwell Plasmid isolation kits 

15 (Chatsworth, CA) or Qiagen Plasmid Midi kit. These kits 

follow the same general procedure for plasmid DNA isolation. 
Briefly, r..-.lls are pelleted by centrifugation (5000 x g) , 
plasmid DNA released with sequential NaOH/acid treatment, 
and cellular debris is removed by centrifugation (10000 x 

20 g) . The supernatant (containing the plasmid DNA) is loaded 
onto a column containing a DNA-binding resin, the column is 
washed, and plasmid DNA eluted with TE. After screening for 
the colonies with the plasmid of interest, the E. coli cells 
are inoculated into 50-100 mLs of LB plus appropriate 

25 antibiotic for overnight growth at 37°C in an air incubator 
while shaking. The purified plasmid DNA is used for DNA 
sequencing, further restriction enzyme digestion, additional 
subcloning of DNA fragments and transfection into mammalian, 
E. coli or other cells. 

30 

Sequence confirmation . 

Purified plasmid DNA is resuspended in dH 2 0 and 
quantitated by measuring the absorbance at 260/280 run in a 
35 Bausch and Loxnb Spectronic 601 UV spectrometer. DNA samples 
are sequenced using ABI PRISM™ DyeDeoxy™ terminator 



WO 97/12977 



57 



PCT/US96/15935 



sequencing chemistry (Applied Biosystems Division of Perkin 
Elmer Corporation, Lincoln City, CA) kits (Part Number 
401388 or 402078) according to the manufacturers suggested 
protocol usually modified by the addition of 5% DMSO to the 
5 sequencing mixture. Sequencing reactions are performed in a 
Model 480 DNA thermal cycler (Perkin Elmer Corporation, 
Norwalk, CT) following the recommended amplification 
conditions. Samples are purified to remove excess dye 
terminators with Centri-Sep™ spin columns (Princeton 

10 Separations, Adelphia, NJ) and lyophilized. Fluorescent dye 
labeled sequencing reactions are resuspended in deionized 
formamide, and sequenced on denaturing 4.75% polyacrylamide- 
8M urea gels using an ABI Model 373A automated DNA 
sequencer. Overlapping DNA sequence fragments are analyzed 

15 and assembled into master DNA contigs using Sequencher v2.1 
DNA analysis software (Gene Codes Corporation, Ann Arbor, 
MI) . 

Expression of G-CSF recentor agonists in mammalian cells 

20 

Mammalian Cell Transf ection/Production of Conditioned Media 

The BHK-21 cell line can be obtained from the ATCC 
(Rockville, MD) . The cells are cultured in Dulbecco's 

25 modified Eagle media (DMEM/high-glucose) , supplemented to 

2mM (mM) L-glutamine and 10% fetal bovine serum (FBS) . This 
formulation is designated BHK growth media. Selective media 
is BHK growth media supplemented with 453 units/mL 
hygromycin B (Calbiochem, San Diego, CA) . The BHK-21 cell 

30 line was previously stably transf ected with the HSV 

transactivating protein VP16, which transact ivates the IE110 
promoter found on the plasmid pMON3359 (See Hippenmeyer et 
al., Bio/Technology, pp. 1037-1041, 1993). The VP16 protein 
drives expression of genes inserted behind the IE110 

35 promoter. BHK-21 cells expressing the transactivating 
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protein VP«6 are designated BHK-VP16. The plasmid pMON1118 
(See Highkin et al. # Poultry Sci . , 70: 970-981, 1991) 
expresses the hygromycin resistance gene from the SV40 
promoter. A similar plasmid is available from ATCC, pSV2- 
5 hph. 

BHK-VP16 cells are seeded into a 60 millimeter (mm) 
tissue culture dish at 3 X 10 5 cells per dish 24 hours prior 
to transfection. Cells are transfected for 16 hours in 3 mL 
of "OPTIMEM*™ (Gibco-BRL, Gaithersburg, MD) containing 10 

10 ug of plasmid DNA containing the gene of interest, 3 ug 

hygromycin resistance plasmid, pMON1118, and 80 ug of Gibco- 
BRL "LI POFECTAMINE * ™ per dish. The media is subsequently 
aspirated and replaced with 3 mL of growth' media. At 48 
hours post-transfection, media from each dish is collected 

15 and assayed for activity (transient conditioned media) . The 
cells are removed from the dish by trypsin-EDTA, diluted 
1:10 and transferred to 100 mm tissue culture dishes 
containing 10 mL of selective media. After approximately 7 
days in selective media, resistant cells grow into colonies 

20 several millimeters in diameter. The colonies are removed 

from the dish with filter paper (cut to approximately the 
same size as the colonies and soaked in trypsin/EDTA) and 
transferred to individual wells of a 24 well plate 
containing 1 mL of selective media. After the clones are 

25 grown to confluence, the conditioned media is re-assayed, 
and positive clones are expanded into growth media. 

Expression of G-CSF receptor aooniftfg in E . roll 

30 E. coli strain MON105 or JM101 harboring the plasmid of 

interest are grown at 37°C in M9 plus casamino acids medium 
with shaking in a air incubator Model G25 from New Brunswick 
Scientific (Edison, New Jersey) . Growth is monitored at 
OD600 until it reaches a value of 1, at which time nalidixic 

35 acid (10 milligrams/mL) in 0.1 N NaOH is added to a final 
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concentration of 50 ^Lg/mL. The cultures are then shaken at 
37°C for three to four additional hours. A high degree of 
aeration is maintained throughout culture period in order to 
achieve maximal production of the desired gene product. The 
5 cells are examined under a light microsccpe for the presence 

of inclusion bodies (IB) . One mL aliquot s of the culture 
are removed for analysis of protein content by boiling the 
pelleted cells, treating them with reducing buffer and 
electrophoresis via SDS-PAGE (see Maniatis et al . Molecular 
10 Cloning: A Laboratory Manual, 1982). The culture is 
centrifuged (5000 x g) to pellet the cells. 

Inclusion Body preparation. Extraction. Refolding. Dialysis. 
DEAE Chromatography, and Characterization of the G-CSF 
15 receptor agonists which accumulate as inclusion bodies in £1 

CflLL 

Isolation of Inclusion Bodies: 

20 The cell pellet from a 330 mL E. colz culture is 

resuspended in 15 mL of sonication buffer (10 mM 2-amino-2- 
(hydroxymethyl) 1, 3 -propanediol hydrochloride (Tris-HCl), pH 
8.0 + 1 mM ethylenediaminetetraacetic acid (EDTA) ) . These 
resuspended cells are sonicated using the microtip probe of 

25 a Sonicator Cell Disruptor (Model W-375, Heat Systems- 
Ultrasonics, Inc., Farmingdale, New York). Three rounds of 
sonication in sonication buffer followed by centrif ugation 
are employed to disrupt the cells and wash the inclusion 
bodies (IB) . The first round of sonication is a 3 minute 

30 burst followed by a 1 minute burst, and the final two rounds 
of sonication are for 1 minute each. 

Extraction and refolding of proteins from inclusion body 
pellets : 
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Folloving the final centrifugation step, the IB pellet 
is resuspended in 10 mL of 50 mM Tris-HCl, pH 9.5, 8 M urea 
and 5 mM dithiothreitol (DTT) and stirred at room 
temperature for approximately 45 minutes to allow for 
5 denaturation of the expressed protein. 

The extraction solution is transferred to a beaker 
containing 7 0 mL of 5mM Tris-HCl, pH 9.5 and 2.3 M urea and 
gently stirred while exposed to air at 4°C for 18 to 48 
hours to allow the proteins to refold. Refolding is 

10 monitored by analysis on a Vydac (Hesperia, Ca.) C18 

reversed phase high pressure liquid chromatography (RP-HPLC) 
column (0.46x25 cm). A linear gradient of 40% to 65% 
acetonitrile, containing 0.1% trif luoroacetic acid (TFA) , is 
employed to monitor the refold. This gradient is developed 

15 over 30 minutes at a flow rate of 1.5 mL per minute. 

Denatured proteins generally elute later in the gradient 
than the refolded proteins . 

Purification: 

20 

Following the refold, contaminating E. coli proteins 
are removed by acid precipitation. The pH of the refold 
solution is titrated to between pH 5.0 and pH 5.2 using 15% 
(v/v) acetic acid (HOAc) . This solution is stirred at 4°C 

25 for 2 hours and then centrifuged for 20 minutes at 12,000 x 
g to pellet any insoluble protein. 

The supernatant from the acid precipitation step is 
dialyzed using a Spectra/ Por 3 membrane with a molecular 
weight cut off (MWCO) of 3,500 daltons. The dialysis is 

30 against 2 changes of 4 liters (a 50-fold excess) of lOmM 

Tris-HCl, p.H 8.0 for a total of 18 hours. Dialysis lowers 
the sample conductivity and removes urea prior to DEAE 
chromatography. The sample is then centrifuged (20 minutes 
at 12,000 x g) to pellet any insoluble protein following 

35 dialysis. 
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A Bio-Rad Bio-Scale DEAE2 column (7 x 52 mm) is used 
for ion exchange chromatography. The column is equilibrated 
in a buffer containing lOmM Tris-HCl, pH 8.0. The protein is 
eluted using a 0-to-500 mM sodium chloride (NaCl) gradient, 
5 in equilibration buffer, over 45 column volumes. A flow 

rate of 1 mL per minute is used throughouc the run. Column 
fractions (2 mL per fraction) are collected across the 
gradient and analyzed by RP HPLC on a Vydac (Hesperia, Ca.) 
C18 column (0.46 x 25 cm). A linear gradient of 40% to 65% 

10 acetonitrile, containing 0,1% trif luoroacetic acid (TFA) , is 
employed. This gradient is developed over 30 minutes at a 
flow rate of 1.5 mL per minute. Pooled fractions are then 
dialyzed against 2 changes of 4 liters (50-to-500-fold 
excess) of 10 mM ammonium acetate ( NH4AC ) , pH 4.0 for a 

15 total of 18 hours. Dialysis is performed using a 

Spectra/Por 3 membrane with a MWCO of 3,500 daltons. 
Finally, the sample is sterile filtered using a 0.22Mm 
syringe filter (llStar LB syringe filter, Costar, Cambridge, 
Ma.), and stored at 4°C. 

20 In some cases the folded proteins can be affinity 

purified using affinity reagents such as mAbs or receptor 
subunits attached to a suitable matrix. Alternatively, (or 
in addition) purification can be accomplished using any of a 
variety of chromatographic methods such as: ion exchange, 

25 gel filtration or hydrophobic chromatography or reversed 
phase HPLC. 

These and other protein purification methods are 
described in detail in Methods in Enzymology, Volume 182 
30 'Guide to Protein Purification 1 edited by Murray Deutscher, 
Academic Press, San Diego, CA (1990) . 

Protein Characterization: 
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The purified protein is analyzed by RP-HPLC, 
electrospray mass spectrometry, and SDS-PAGE. The protein 
quantitation is done by amino acid composition, RP-HPLC, and 
Bradford protein determination. In some cases tryptic 
5 peptide mapping is performed in conjunction with 

electrospray mass spectrometry to confirm the identity of 
the protein. 

AML Proliferation A^y 

10 The factor-dependent cell line AML 193 was obtained 

from the American Type Culture Collection (ATCC, Rockville, 
MD) . This cell line, established from a patient with acute 
myelogenous leukemia, is a growth factor dependent cell line 
which displayed enhanced growth in GM-CSF supplemented 

15 medium (Lange, B . , et al., Blood 70: 192, 1987; Valtieri, 

M., et al., J . Immunol. 138:4042, 1987). The ability of AML 
193 cells to proliferate in the presence of human IL-3 has 
also been documented. (Santoli, D., et al . , J. Immunol. 
139: 348, 1987). A cell line variant was used, AML 193 1.3, 

20 which was adapted for long term growth in IL-3 by washing 
out the growth factors and starving the cytokine dependent 
AML 193 cells for growth factors for 24 hours. The cells 
are then replated at 1x105 cells/well in a 24 well plate in 
media containing 100 U/mL IL-3. It took approximately 2 

25 months for the cells to grow rapidly in IL-3. These cells 
are maintained as AML 193 1.3 thereafter by supplementing 
tissue culture medium (see below) with human IL-3. 

AML 193 1.3 cells are washed 6 times in cold Hanks 
balanced salt solution (HBSS, Gibco, Grand Island, NY) by 

30 centrifuging cell suspensions at 250 x g for 10 minutes 

followed by decantation of the supernatant. Pelleted cells 
are resuspended in HBSS and the procedure is repeated until 
six wash cycles are completed. Cells washed six times by 
this procedure are resuspended in tissue culture medium at a 

35 density ranging from 2 x 105 to 5 x 10* viable cells/mL. 
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• This medium is prepared by supplementing Iscove's modified 
Dulbecco's Medium (IMDM, Hazel ton, Lenexa, KS) with albumin, 
transferrin, lipids and 2-mercaptoethanol . Bovine albumin 
(Boehringer-Mannheim, Indianapolis, IN) is added at 500 
5 ^g/mL; human transferrin (Boehringer-Mannheim, Indianapolis, 
IN) is added at 100 |ig/mL; soybean lipid (Boehringer- 
Mannheim, Indianapolis, IN) is added at 50 ^g/mL; and 2- 
mercaptoethanol (Sigma, St. Louis, MO) is added at 5 x 10" 5 
M. 

10 Serial dilutions of G-CSF receptor agonist proteins 

are made in triplicate series in tissue culture medium 
supplemented as stated above in 96 well Costar 3596 tissue 
culture plates. Each well contained 50 >il of medium 
containing G-CSF receptor agonist proteins once serial 

15 dilutions are completed. Control wells contained tissue 

culture medium alone (negative control). AML 193 1.3 cell 
suspensions prepared as above are added to each well by 
pipetting 50 \il (2.5 x 104 cells) into each well. Tissue 
culture plates are incubated at 37°C with 5% C02 in 

20 humidified air for 3 days. On day 3, 0.5 |iCi ^H-thymidine 

(2 Ci/mM, New England Nuclear, Boston, MA) is added in 50 \il 
of tissue culture medium. Cultures are incubated at 37°C 
with 5% CO2 in humidified air for 18-24 hours. Cellular DNA 
is harvested onto glass filter mats (Pharmacia LKB, 

25 Gaithersburg, MD) using a TOMTEC cell harvester (TOMTEC, 

Orange, CT) which utilized a water wash cycle followed by a 
70% ethanol wash cycle. Filter mats are allowed to air dry 
and then placed into sample bags to which scintillation 
fluid (Scintiverse II, Fisher Scientific, St. Louis, MO or 

30 BetaPlate Scintillation Fluid, Pharmacia LKB, Gaithersburg, 
MD) is added. Beta emissions of samples from individual 
tissue culture wells are counted in a LKB BetaPlate model 
1205 scintillation counter (Pharmacia LKB, Gaithersburg, MD) 
and data is expressed as counts per minute of 3H-thymidine 

35 incorporated into cells from each tissue culture well. 
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Activity of each G-CSF receptor agonist proteins preparation 
is quant itated by measuring cell proliferation ( ^H-thymidine 
incorporation) induced by graded concentrations of G-CSF 
receptor agonist. Typically, concentration ranges from 0.05 
5 pM - 105 pM are quantitated in these assays. Activity is 
determined by measuring the dose of G-CSF receptor agonist 
protein which provides 50% of maximal proliferation (EC50 = 
0.5 x (maximum average counts per minute of 3 H-thymidine 
incorporated per well among triplicate cultures of all 

10 concentrations of G-CSF receptor agonists tested - 
background proliferation measured by 3H-thymidine 
incorporation observed in triplicate cultures lacking any 
factor) . This EC50 value is also equivalent to 1 unit of 
bioactivity. Every assay is performed with native 

15 interleukin-3 and G-CSF as reference standards so that 
relative activity levels could be assigned. 

Typically, the G-CSF receptor agonist proteins were 
tested in a concentration range of 2000 pM to 0.06 pM 
titrated in serial 2 fold dilutions. 

20 Activity for each sample was determined by the 

concentration which gave 50% of the maximal response by 
fitting a four-parameter logistic model to the data. It was 
observed that the upper plateau (maximal response) for the 
sample and the standard with which it was compared did not 

25 differ. Therefore relative potency calculation for each 

sample was determined from EC50 estimations for the sample 
and the standard as indicated above. 

Other in vitro cell based nrolif eraf ion assay s 

30 

Other in vitro cell based proliferation assays, known 
to those skilled in the art, may also be useful to determine 
the activity of the G-CSF receptor agonists in a similar 
manner as described in the AML 193.1.3 cell proliferation 
35 assay. 
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Transfected cell lines : 

Cell lines, such as BHK or the murine pro B cell line 
Baf/3, can be transfected with a colony stimulating factor 
5 receptor, .*uch as the human G-CSF receptor which the cell 

line does not have. These transfected cell lines can be used 
to determine the activity of the ligand of which the 
receptor has been transfected. 

10 

EXAMPLE 1 

Construction of nMON3485 

15 The new N-terminus/C-terminus gene in pMON3485 was 

created using Method I as described in Materials and 
Methods. Fragment Start was created and amplified from G- 
CSF Ser* 7 sequence in pMON13037 using the primer set, 39 
start (SEQ ID NO:7) and L-ll start (SEQ ID NO:3) . Fragment 

20 Stop was created and amplified from G-CSF Ser 17 sequence in 
the plasmid, pMON13037 (WO 95/21254), using the primer set, 
38 stop (SEQ ID NO: 8) and L-ll stop (SEQ ID NO:4) . The 
full-length new N terminus/C-terminus G-CSF Ser 17 gene was 
created and amplified from the annealed Fragments Start and 

25 Stop using the primers 39 start (SEQ ID NO: 7) and 38 stop 
(SEQ ID NO:8) . 

The resulting DNA fragment which contains the new gene 
was digested with restriction endonucleases Ncol and Hindlll 
and purified using a Magic DNA Clean-up System kit (Promega, 

30 Madison, WI) . The plasmid, pMON3934 (derivative of 

PMON3359), was digested with restriction endonucleases 
Hindlll and Ncol, resulting in an approximately 3800 base 
pair vector fragment, and gel-purified. The purified 
restriction fragments were combined and ligated using T4 DNA 

35 ligase. A portion of the ligation reaction was used to 
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transform £. coli strain DH5cc cells (Life Technologies, 
Gaithersburg, MD) . Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and 
sequenced to confirm the correct insert. The resulting 
5 plasmid was designated pMON3485. 

BHK cells were transfected with the plasmid, pMON3485, 
for protein expression and bioassay. 

The plasmid, pMON3485 containing the gene sequence of 
10 (SEQ ID NO:25), encodes the following amino acid sequence: 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala 
Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe 
Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu 
Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
Ala Leu. Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 
Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 
Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser 
Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin 
Glu Lys Leu Cys Ala Thr (SEQ ID NO: 43) 

25 EXAMPLE 2 

Construction of PMQN3486 

The new N-terminus/C- terminus gene in pMON3486 was 
30 created using Method I as described in Materials and 

Methods. Fragment Start was created and amplified from G- 
CSF Ser 17 sequence in the plasmid, pMON13037, using the 
primer set, 97 start (SEQ ID NO: 9) and L-ll start (SEQ ID 
NO:3). Fragment Stop was created and amplified from G-CSF 
35 Ser 17 sequence in.pMON13037 using the primer set, 96 stop 



15 



20 
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(SEQ ID NO:10) and L-ll stop (SEQ ID NO:4). The full-length 
new N terminus /C-terminus G-CSF Ser 17 gene was created and 
amplified from the annealed Fragments Start and Stop using 
the primers 97 start (SEQ ID NO: 9) and 96 stop (SEQ ID 
5 NO:10>. 

The resulting DNA fragment which contains the new gene 
was digested with restriction endonucleases Ncol and Hindlll 
and gel-purified using a Magic DNA Clean-up System kit. The 
plasmid, pMON3934, was digested with restriction 
10 endonucleases Hindlll and Ncol, resulting in an 

approximately 3800 base pair vector fragment, and gel- 
purified. The purified restriction fragments were combined 
and ligated using T4 DNA ligase. A portion of the ligation 
reaction was used to transform B. coii strain DH5a cells. 

15 Transformant bacteria were selected on ampicillin-containing 
plates. Plasmid DNA was isolated and sequenced to confirm 
the correct insert. The resulting plasmid was designated 
PMON3486. 

BHK cells were transfected with the plasmid, pMON3486, 
20 for protein expression and bioassay. 



The plasmid, pMON3486 containing the gene sequence of 



(SEQ ID 


NO:26) , 


encodes 


the 


following amino 


acid sequence: 


Pro 


Glu 


Leu Gly 


Pro Thr 


Leu 


Asp Thr Leu Gin 


Leu Asp Val Ala 


Asp 


Phe 


Ala 


Thr 


Thr He 


Trp 


Gin Gin Met Glu 


Glu Leu Gly Met 


Ala 


Pro 


Ala 


Leu 


Gin Pro 


Thr 


Gin Gly Ala Met 


Pro Ala Phe Ala 


Ser 


Ala 


Phe 


Gin 


Arg Arg 


Ala 


Gly Gly Val Leu 


Val Ala Ser His 


Leu 


Gin 


Ser 


Phe 


Leu Glu 


Val 


Ser Tyr Arg Val 


Leu Arg His Leu 


Ala 


Gin 


Pro 


Ser 


Gly Gly 


Ser 


Gly Gly Ser Gin 


Ser Phe Leu Leu 


Lys 


Ser 


Leu 


Glu 


Gin Val 


Arg 


Lys He Gin Gly 


Asp Gly Ala Ala 


Leu 


Gin 


Glu 


Lys 


Leu Cys 


Ala 


Thr Tyr Lys Leu 


Cys His Pro Glu 


Glu 


Leu 


Val 


Leu 


Leu Gly 


His 


Ser Leu Gly He 


Pro Trp Ala Pro 


Leu 


Ser 


Ser 


Cys 


Pro Ser 


Gin 


Ala Leu Gin Leu 


Ala Gly Cys Leu 



WO 97/12977 PCT/US96/1 5935 

68 

Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 
Ala Leu Glu Gly He Ser (SEQ ID NO: 44) 

EXAMPLE 3 

5 

Construction of PMQN3487 

The new N-terminus/C-terminus gene in pMON3487 was 
created using Method I as described in Materials and 

10 Methods. Fragment Start was created and amplified from G- 
CSF Ser 17 sequence in the plasmid, pMON13037, using the 
primer set, 126 start (SEQ ID NO: 11) and L-ll start (SEQ ID 
NO:3). Fragment Stop was created and amplified from G-CSF 
Ser 17 sequence in pMON13037 using the primer set, 125 stop 

15 (SEQ ID NO:12) and L-ll stop (SEQ ID NO:4) . The full-length 

new N terminus/C-terminus G-CSF Ser 17 gene was created and 
amplified from the annealed Fragments Start and Stop using 
the primers 126 start (SEQ ID NO:ll) and 125 stop (SEQ ID 
NO:12) . 

20 The resulting DNA fragment which contains the new gene 

was digested with restriction endonucleases Ncol and Hindlll 
and purified using a Magic DNA Clean-up System kit. The 
plasmid, pMON3934, was digested with restriction 
endonucleases Hindlll and Ncol, resulting in an 

25 approximately 3800 base pair vector fragment, and gel- 

purified. The purified restriction fragments were combined 
and ligated using T4 DNA ligase. A portion of the ligation 
reaction was used to transform E. coli strain DH5ct cells. 

Transf ormant bacteria were selected on ampicillin-containing 
30 plates. Plasmid DNA was isolated and sequenced to confirm 
the correct insert. The resulting plasmid was designated 
PMON3487 . 

BHK cells were transf ected with the plasmid, pMON3487, 
for protein expression and bioassay. 

35 
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The piasmid, pMON3487 containing the gene sequence of 
(SEQ ID NO: 27 ) , encodes the following amino acid sequence: 



Met 


Ala 


Pro 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Ala 


Ser 


Ala 


Phe 


Gin 


Arg 


Arg 


Ala 


Gly Gly 


Val 


Leu 


Val 


Ala 


Ser 


His 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu 


Val 


Ser Tyr 


Arg 


Val 


Leu 


Arg 


His 


Leu 


Ala 


Gin 


Pro 


Ser 


Gly 


Gly 


Ser 


Gly Gly 


Ser 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Gin 


Val 


Arg 


Lys He 


Gin 


Gly 


Asp 


Gly 


Ala 


Ala 


Leu 


Gin 


Glu 


Lys 


Leu 


Cys 


Ala 


Thr Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 


Leu 


Leu 


Gly 


His 


Ser Leu 


Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Cys 


Pro 


Ser 


Gin 


Ala Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 


Gly 


Leu 


Phe Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


Ser 


Pro 


Glu Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 


Phe Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


(SEQ ID 


NO: 45) 













EXAMPLE 4 



Ponst-ruction of nMON3488 

20 

The new N-terminus/C-terminus gene in pMON3488 was 
created using Method I as described in Materials and 
Methods, Fragment Start was created and amplified from G- 
CSF Ser 17 sequence in the piasmid, pMON13037, using the 

25 primer set, 133 start (SEQ ID NO: 13) and L-ll start (SEQ ID 
NO: 3) . Fragment Stop was created and amplified from G-CSF 
Ser 17 sequence in the piasmid, pMON13037 using the primer 
set, 132 stop (SEQ ID NO:14) and L-ll stop (SEQ ID NO:4) . 
The full-length new N terminus /C- terminus G-CSF Ser 17 gene 

30 was created and amplified from the annealed Fragments Start 
and Stop using the primers 133 start (SEQ ID NO: 13) and 132 
stop (SEQ ID NO: 14) . 

The resulting DNA fragment which contains the new gene 
was dig sted with restriction endonucleases Ncol and Hindlll 

35 and purified using a Magic DNA Clean-up System kit. The 
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plasmid, pMON3934, was digested with restriction 
endonucleases Hindlll and Ncol, resulting in an 
approximately 3800 base pair vector fragment, and gel- 
purified. The purified restriction fragments were combined 
5 and ligated using T4 DNA ligase. A portion of the ligation 
reaction was used to transform E. coii strain DH5ocells. 
Transformdnt bacteria were selected on ampicillin-containing 
plates. Plasmid DNA was isolated and sequenced to confirm 
the correct insert. The resulting plasmid was designated 
10 pMON3488. 

BHK cells were transfected with the plasmid, pMON3488, 
for protein expression and bioassay. 

The plasmid, pMON3488 containing the gene sequence of 
15 (SEQ ID NO:28), encodes the following amino acid sequence: 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg 
Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 
Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Ser Gly Gly 
Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys 
Ala Thr iyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly 
His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser 
Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
Ala Pro Ala Leu Gin Pro (SEQ ID NO: 46) 

30 EXAMPLE 5 

Construction of PMQN3489 



20 



25 



35 



The new N- terminus /C- terminus gene in pMON3489 was 
created using Method I as described in Materials and 
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Methods. Fragment Start was created and amplified from G- 
CSF Ser 17 sequence in the piasmid, pMON13037, using the 
primer set, 142 start (SEQ ID NO:15) and L-ll start (SEQ ID 
NO: 3) . Fragment Stop was created and amplified from G-CSF 
5 Ser 17 sequence in pMON13037 using the primer set, 141 stop 

(SEQ ID NO:16) and L-ll stop (SEQ ID NO:4) . The full-length 
new N terminus/C-terminus G-CSF Ser 17 gene was created and 
amplified from the annealed Fragments Start and Stop using 
the primers 142 start (SEQ ID N0:15) and 141 stop (SEQ ID 
10 NO:16) . 

The resulting DNA fragment which contains the new gene 
was digested with restriction endonucleases Ncol and Hindlll 
and purified using a Magic DNA Clean-up System kit. The 
piasmid, pMON3934, was digested with restriction 
15 endonucleases Hindlll and Ncol, resulting in an 

approximately 3800 base pair vector fragment, and gel- 
purified. The purified restriction fragments were combined 
and ligated using T4 DNA ligase. A portion of the ligation 
reaction was used to transform E. coli strain DHSccells. 

20 Transformant bacteria were selected on ampicillin-containing 
plates. Piasmid DNA was isolated and sequenced to confirm 
the correct insert. The resulting piasmid was designated 
pMON3489. 

BHK cells were transfected with the piasmid, pMON3489, 
25 for protein expression and bioassay. 

The piasmid, pMON3489 containing the gene sequence of 
(SEQ ID NO:29), encodes the following amino acid sequence: 

30 Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 
Ala Gin Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu 
Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala 
Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu 

35 Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro 
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Leu 


Ser 


Ser 


Cys 


Pro 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


Ser 


Pro 


Glu 


Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 


Phe 


Ala 


Thr 


Thr 


He Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu Gly Met 


Ala 


Pro 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Ala 


(SEQ ID 


NO: 47) 













EXAMPLE 

10 

Construction of r>MQN3 4 Q Q 

The new N-terminus/C-terminus gene in pMON3490 was 
created using Method II as described in Materials and 

15 Methods. Fragment Start was created and amplified from G- 
CSF sequence in the plasmid, pMON13037, using the primer 
set, 39 start (SEQ ID NO:7) and P-bl start (SEQ ID NO:5) . 
Fragment Stop was created and amplified from G-CSF Ser 17 
sequence in pMON13037 using the primer set, 38 stop (SEQ ID 

20 NO: 8) and P-bl stop (SEQ ID NO: 6) . Fragment Start was 

digested with restriction endonuclease Ncol, and Fragment 
Stop was digested with restriction endonuclease Hindlll. 
After purification, the digested Fragments Start and Stop 
were combined with and ligated to the approximately 3800 

25 base pair Ncol-Hindlll vector fragment of pMON3934. 

Transformant bacteria were selected on ampicillin-containing 
plates. Plasmid DNA was isolated and sequenced to confirm 
the correct insert. The resulting plasmid was designated 
PMON3490. 

30 BHK cells were transfected with the plasmid, pMON3490, 

for protein expression and bioassay. 

The plasmid, pMON3490 containing the gene sequence of 
(SEQ ID NO:30), encodes the following amino acid sequence: 
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Tyr Lys Leu Cys His Pro Glu Glu 
Leu Gly lie Pro Trp Ala Pro Leu 
Leu Gin Leu Ala Gly Cys Leu Ser 
Leu Tyr Gin Gly Leu Leu Gin Ala 
5 Leu Gly Pro Thr Leu Asp Thr Leu 

Ala Thr Thr He Trp Gin Gin Met 
Ala Leu Gin Pro Thr Gin Gly Ala 
Phe Gin Arg Arg Ala Gly Gly Val 
Ser Phe Leu Glu Val Ser Tyr Arg 
10 Pro Thr Pro Leu Gly Pro Ala Ser 
Leu Lys Ser Leu Glu Gin Val Arg 
Ala Leu Gin Glu Lys Leu Cys Ala 



Leu 


Val Leu 


Leu 


Gly 


His 


Ser 


Ser 


Ser Cys 


Pro 


Ser 


Gin 


Ala 


Gin 


Leu His 


Ser 


Gly 


Leu 


Phe 


Leu 


Glu Gly 


He 


Ser 


Pro 


Glu 


Gin 


Leu Asp 


Val 


Ala 


Asp 


Phe 


Glu 


Glu Leu Gly 


Met 


Ala 


Pro 


Met 


Pro Ala 


Phe 


Ala 


Ser 


Ala 


Leu 


Val Ala 


Ser 


His 


Leu 


Gin 


Val 


Leu Arg 


His 


Leu 


Ala 


Gin 


Ser 


Leu Pro 


Gin 


Ser 


Phe 


Leu 


Lys 


He Gin Gly 


Asp 


Gly Ala 


Thr 


(SEQ ID 


NO: 48) 







EXAMPLE 7 

15 

Construction of PMON3491 



The new N-terminus/C-terminus gene in pMON3491 was 
created using Method II as described in Materials and 

20 Methods. Fragment Start was created and amplified from G- 
CSF sequence in the plasmid, pMON13037, using the primer 
set, 97 start (SEQ ID NO:9) and P-bl start (SEQ ID NO:5) . 
Fragment Stop was created and amplified from G-CSF Ser 17 
sequence in pMON13037 using the primer set, 96 stop (SEQ ID 

25 NO:10) and P-bl stop (SEQ ID NO:6) . Fragment Start was 

digested with restriction endonuclease Ncol, and Fragment 
Stop was digested with restriction endonuclease Hindlll. 
After purification, the digested Fragments Start and Stop 
were combined with and ligated to the approximately 3800 

30 base pair Ncol-Hindlll vector fragment of pMON3934. A 

portion of the ligation reaction was used to transform E. 
coli strain DH5a cells. Transformant baicteria were 

selected on ampicillin-containing plates. Plasmid DNA was 
isolated and s quenced to confirm th correct insert. The 
35 resulting plasmid was designated pMON3491. 
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BHK cells were transfected with the plasmid, pMON3491, 
for protein expression and bioassay. 

The plasmid, pMON3491 containing the gene sequence of 
5 (SEQ ID N0:31), encodes the following amino acid sequence: 



Pro 


Glu 


Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 


Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


Met 


Ala 


Pro 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Ala 


Ser 


Ala 


Phe 


Gin 


Arg 


Arg 


Ala 


Gly 


Gly 


Val 


Leu 


Val 


Ala 


Ser 


His 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu 


Val 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 


Leu 


Ala 


Gin 


Pro 


Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Gin 


Val 


Arg 


Lys 


He 


Gin 


Gly 


Asp 


Gly 


Ala 


Ala 


Leu 


Gin 


Glu 


Lys 


Leu 


Cys 


Ala 


Thr 


Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 


Leu 


Leu 


Gly 


His 


Ser 


Leu Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Cys 


Pro 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Le.j 


Ser 


Gin 


Leu 


His 


Ser 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


Ser 


(SEQ ID 


NO:49) 







20 EXAMPLE 8 

Construction of PMON34 92 



The new N-terminus/C-terminus gene in pMON3492 was 
25 created using Method II as described in Materials and 

Methods. Fragment Start was created and amplified from G- 
CSF sequence in the plasmid, pMON13037, using the primer 
set, 126 start (SEQ ID NO:ll) and P-bl start (SEQ ID NO:5). 
Fragment Stop was created and amplified from G-CSF Ser 17 
30 sequence in pMON13037 using the primer set, 125 stop (SEQ ID 

NO:12) and P-bl stop (SEQ ID NO:6) . Fragment Start was 
digested w.'.th restriction endonuclease Ncol, and Fragment 
Stop was digested with restriction endonuclease Hindlll. 
After purification, the digested Fragments Start and Stop 
35 were combined with and ligated to the approximately 3800 
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base pair Ncol-Hindlll vector fragment of pMON3934. A 
portion of the ligation reaction was used to transform E. 
coli strain DH5ct cells. Transformant bacteria were 

selected on ampicillin-containing plates. Plasmid DNA was 
5 isolated and sequenced to confirm the correct insert. The 
resulting plasmid was designated pMON3492. 

BHK cells were transfected with the plasmid, pMON3492, 
for protein expression and bioassay. 

10 The plasmid, pMON3492 containing the gene sequence of 

(SEQ ID NO:32), encodes the following amino acid sequence: 



Met 


Ala 


Pro 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Ala 


Ser 


Ala 


Phe 


Gin 


Arg 


Arg 


Ala 


Gly 


Gly 


Val 


Leu 


Val 


Ala 


Ser 


His 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu 


Val 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 


Leu 


Ala 


Gin 


Pro 


Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Gin 


Val 


Arg 


Lys 


He 


Gin Gly 


Asp 


Gly 


Ala 


Ala 


Leu 


Gin 


Glu 


Lys 


Leu 


Cys 


Ala 


Thr 


Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 


Leu 


Leu 


Gly 


His 


Ser 


Leu 


Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Cys 


Pro 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


Ser 


Pro 


Glu 


Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 


Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


(SEQ ID 


NO:50) 







25 

EXAMPLE 9 

Construction of PMQN3493 

30 

The new N-terminus/C-terminus gene in pMON3493 was 
created using Method II as described in Materials and 
Methods. Fragment Start was created and amplified from G- 
CSF sequence in the plasmid, pMON13037, using the primer 
35 set, 133 start (SEQ ID NO:13) and P-bl start (SEQ ID NO:5). 
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Fragment Stop was created and amplified from G-CSF Ser 17 
sequence in pMON13037 using the primer set, 132 stop (SEQ ID 
NO: 14) and P-bl stop (SEQ ID NO: 6). Fragment Start was 
digested with restriction endonuclease Ncol, and Fragment 
5 Stop was digested with restriction endonuclease Hindlll. 

After purification, the digested Fragments Start and Stop 
were combined with and ligated to the approximately 3800 
base pair Ncol-Hindlll vector fragment of pMON3934. A 
portion of the ligation reaction was used to transform E. 

10 coli strain DH5a cells. Transformant bacteria were 

selected on ampicillin-containing plates. Plasmid DNA was 
isolated and sequenced to confirm the correct insert. The 
resulting plasmid was designated pMON3493. 

BHK cells were transfected with the plasmid, pMON3493, 

15 for protei i expression and bioassay. 

The plasmid, pMON3493 containing the gene sequence of 
(SEQ ID NO:33), encodes the following amino acid sequence: 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg 
Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 
Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu 
Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu 
Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu 
Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser 
Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 
His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu 
Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu 
Asp Val Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu 
Leu Gly Met Ala Pro Ala Leu Gin Pro (SEQ ID NO: 51) 

EXAMPLE 10 
35 Construction of PMON3494 



20 



25 



30 
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The new N-terminus/C-terminus gene in pMON3494 was 
created using Method II as described in Materials and 
Methods. Fragment Start was created and amplified from G- 
5 CSF sequence in the plasmid, pMON13037, using the primer 

set, 142 start (SEQ ID N0:15) and P-bl start (SEQ ID NO:5). 
Fragment Stop was created and amplified from G-CSF Ser 17 
sequence i > pMON13037 using the primer set, 141 stop { SEQ ID 
NO: 16) and P-bl stop (SEQ ID N0:6). Fragment Start was 

10 digested with restriction endonuclease Ncol, and Fragment 

Stop was digested with restriction endonuclease Hindlll. 
After purification, the digested Fragments Start and Stop 
were combined with and ligated to the approximately 3800 
base pair Ncol-Hindlll vector fragment of pMON3934. A 

15 portion of the ligation reaction was used to transform E. 
coli strain DH5a cells. Transformant bacteria were 

selected on ampicillin-containing plates. Plasmid DNA was 
isolated and sequenced to confirm the correct insert. The 
resulting plasmid was designated pMON3494. 
20 BHK cells were transfected with the plasmid, pMON3494, 

for protein expression and bioassay. 



The pxasmid, pMON3494 containing the gene sequence of 



25 



(SEQ ID 


NO: 34) , 


encodes 


the 


following amino 


acid sequence: 


Ser 


Ala 


Phe 


Gin 


Arg Arg 


Ala 


Gly Gly Val 


Leu 


Val Ala 


Ser 


His 


Leu 


Gin 


Ser 


Phe 


Leu 


Glu 


Val 


Ser Tyr Arg 


Val 


Leu Arg 


His 


Leu 


Ala 


Gin 


Pro 


Thr 


Pro 


Leu 


Gly 


Pro Ala Ser 


Ser 


Leu Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Gin Val Arg 


Lys 


He Gin 


Gly 


Asp 


Gly Ala 


Ala 


Leu 


Gin 


Glu 


Lys 


Leu Cys Ala 


Thr 


Tyr Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 


Leu 


Leu Gly His 


Ser 


Leu Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Cys 


Pro Ser Gin 


Ala 


Leu Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser Gly Leu 


Phe 


Leu Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


lie Ser Pro 


Glu 


Leu Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val Ala Asp 


Phe 


Ala Thr 


Thr 


He 
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Trp Gin G.l:i Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro 
Thr Gin Gly Ala Met Pro Ala Phe Ala (SEQ ID NO: 52) 

Examples 11-3 0 

5 

The genes encoding the G-CSF receptor agonists of 
Examples 1-10 were excised from the BHK vectors as a 
Ncol/Hindlll fragment and ligated with the ~ 3630 base pair 
Ncol/Hindlll vector fragment of pMON2341 (WO 94/12638). The 

10 resulting plasmids (Examples 11-20) are indicated in Table 
4. The plasmids were transformed into E. coli strain JM101 
cells and expression of the G-CSF receptor agonist protein 
was evaluated. The proteins expressed are the same as those 
expressed in the parental BHK expression vector except the 

15 proteins were immediately preceded by a Methionine-Alanine 
dipeptide and the Methionine is processed off by methionine 
aminopeptidase. Overnight growths of cells (20 Klett units) 
were inoculated in lOmL of minimal M9 medium supplemented 
with vitamin Bl and trace minerals and incubated with 

20 shaking at 37°C until initial Klett readings of -120 units 
were obtained. At 120 Klett units 50uL of lOmg/mL nalidixic 
acid was added. Four hours post -induct ion, a 1ml aliquot was 
removed for protein expression analysis by SDS-PAGE. Cells 
were also examined using light microscopy for the presence 

25 of inclusion bodies. Only pMON3450 and pMON3455 had 

significant expression levels of the G-CSF receptor agonist 
protein. In an effort to improve expression levels of G-CSF 
receptor agonists, the 5' end of the genes were re- 
engineered to incorporate AT-rich codons and E. coli 

30 preferred codons between the unique Ncol and Nhel 

restriction endonuclease recognition sites (Examples 21-28) . 
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TABLE 4 







E. coli 


expression 


plasmids 








Resulting 






Parental 


Example 


# 


E. coli 


Breakpoint 


Linker 


BHK plasmi 






expression 






pM0N# 






plasmid 












pMON# 








Example 


11 


PMON3450 


38/39 


zero 


PMON3490 


Example 


12 


PMON3455 


38/39 


Al-10 


PMON3485 


Example 


13 


PMON3451 


96/97 


zero 


PMON3491 


Example 


14 


PMON3456 


96/97 


Al-10 


PMON3486 


Example 


15 


PMON3452 


125/126 


zero 


PMON3492 


Example 


16 


PMON3457 


125/126 


Al-10 


PMON3487 


Example 


17 


PMON3453 


132/133 


zero 


PMON3493 


Example 


18 


PMON3458 


132/133 


Al-10 


PMON3488 


Example 


19 


PMON3454 


141/142 


zero 


PMON3494 


Example 


20 


PMON3459 


141/142 


Al-10 


PMON3489 



5 

Example 21 

10 Construction of PMON25184 

The complementary pair of synthetic oligomers, 
141for.seq (SEQ ID NO:23) and 141rev.seq (SEQ ID NO:24) , 
(Midland Certified Reagent Co., Midland TX) were annealed by 

15 heating 2ug of each synthetic oligomer in a 20ul reaction 
mixture containing 20mM Tris-HCl (7.5), lOmM MgCl2 , and 
50mM NaCl, at 80°C for 5 minutes, and allowing the mixture 
to slowly cool to ambient temperature (approximately 45 
minutes) . When properly annealed the oligomers create an 

20 Ncol site at the 5' end and a Nhel site at the 3' end. 
Approximately 15 ng of the annealed oligomer pair was 
ligated with the gel-purified - 4120 base pair Ncol/Nhel 
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vector frajment of pMON3454 (-molar ratio of 10:1). The 
resulting gene, had seven codon changes at the 5* end of the 
gene. The ligation reaction was used to transform E. coli 
strain DH5a and the desired codon changes were confirmed by 
5 DNA sequence analysis. The resulting plasmid was designated 
pMON25184. Plasmid, pMON25184 containing the gene sequence 
of (SEQ ID 1*0:38), DNA was retransf ormed into E. coli strain 
JM101 cells for protein expression. The protein expressed is 
the same as that expressed from pMON3454. 

10 

Example 22 

Construction of PMQN25183 

15 The complementary pair of synthetic oligomers, 

141for.seq (SEQ ID NO:23) and 141rev.seq (SEQ ID NO:24), 
(Midland Certified Reagent Co., Midland TX) were annealed by 
heating 2ug of each synthetic oligomer in a 20ul reaction 
mixture containing 20mM Tris-HCl (7.5), lOmM MgCl2 * and 

20 50mM NaCl, at 80°C for 5 minutes, and allowing the mixture 
to slowly cool to ambient temperature (approximately 45 
minutes) . When properly annealed the oligomers create an 
Ncol site at the 5' end and a Nhel site at the 3' end. 
Approximately 15ng of the annealed oligomer pair was ligated 

25 with the - 4110 base pair Ncol/Nhel gel-purified pMON3459 
(-molar ratio of 10:1) . The ligation mixture was used to 
transform E. coli strain DH5a and the desired codon changes 

were confirmed by DNA sequence analysis. The resulting 
plasmid was designated pMON25188. The resulting gene, had 
30 seven codon changes at the 5' end of the gene. Plasmid, 

PMON25188 containing the gene sequence of (SEQ ID NO:42), 
DNA was retransformed into E. coJi strain JM101 cells for 
protein expression. The protein expressed is the same as 
that expressed from pMON3459. 



35 
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Example 21 
rnnstruetion of PMON25183 

5 pMON25183 was constructed using an overlapping PCR 

primer method. The synthetic oligomers, 132for.seq (SEQ ID 
NO:321 and 132rev.seq (SEQ ID NO:22), encode the Ncol and 
Nhel restriction recognition sequence, respectively. 
Amplified DNA was generated by the DNA polymerase chain 

10 amplification method using the PCR Optimizer Kit 

(Invitrogen) . The PCR reactions were performed using the 
manufacturer's recommended conditions using 5X buffer B 
(300mM Tris-HCl pH8.5, 75 mM (NH 4 )2S0 4 , 10mM MgCl2> for 
seven cycles consisting of 94°C for 1', 65°C for 2', and 

15 72°C for 2', followed by 20 cycles of 94°C for 1', and 72°C 
for 3', and a final cycle of 7 minutes at 72°C using a 
Perkin Elmer Model 480 DNA thermal cycler (Perkin Elmer) . 
The reaction product was desalted using Centri-Sep spin 
columns (Princeton Separations) following the manufacturer's 

20 recommended protocol, digested with Ncol/Nhel, and gel 

purified from TAE-agarose gels using Gene Clean (Bio 101) 
and the DNA product was eluted in dH20 The purified PCR 
product was ligated with the ~ 4090 base pair Ncol/Nhel 
pMON3453 vector fragment. Positive clones containing the 

25 AT-rich replacement insert were identified as described in 

Example 21. The resulting plasmid was designated pMON25183. 
The resulting gene, had 14 codon changes at the 5' end of 
the gene. Plasmid, pMON25183 containing the gene sequence 
of (SEQ ID NO:37), DNA was retransf ormed into E. coli strain 

30 JM101 cells for protein expression. The protein expressed is 
the same as that expressed from pMON3453. 

Example 24 

3 5 Construction of PMQN25187 
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PMON25187 was constructed using an overlapping PCR 
primer method. The synthetic oligomers, 132for.seq (SEQ ID 
N0:21) and 132rev.seq (SEQ ID NO:22), encode the Ncol and 
5 Nhel restriction recognition sequence, respectively. 

Amplified DNA was generated by the DNA polymerase chain 
amplification method using the PCR Optimizer Kit 
(Invitrogen) . The PCR reactions were performed using the 
manufacturer's recommended conditions, in 5X buffer B for 

10 seven cycles consisting of 94°C for 1', 65°C for 2', and 

72°C for 2', followed by 20 cycles of 94°C for 1', and 72°C 
for 3', and a final cycle of 7 minutes at 72°C using a 
Perkin Elmer Model 480 DNA thermal cycler (Perkin Elmer) . 
The reaction product was desalted using Centri-Sep spin 

15 columns (Pvinceton Separations) following the manufacturer's 
recommended protocol, digested with Ncol /Nhel, and gel 
purified from TAE-agarose gels using Gene Clean (Bio 101) 
and the DNA product was eluted in dH20. The purified PCR 
product was ligated with the - 4080 base pair Ncol/Nhel 

20 pMON3458 vector fragment. Positive clones containing the 
AT-rich replacement insert were identified as described in 
Example 21. The resulting plasmid was designated pMON25187. 
The resulting gene, had 14 codon changes at the 5' end of 
the gene. Plasmid, pMON25187 containing the gene sequence 

25 of (SEQ ID NO:41), DNA was retransf ormed into E. coli strain 
JM101 cells for protein expression. The protein expressed is 
the same as that expressed from pMON3458. 



30 



Example 2 5 

Construction p£ PMQN2518? 



35 



PMON25182 was constructed using the overlapping PCR 
primer approach described in Example 23. The synthetic 
oligomer primers 125for.seq (SEQ ID NO: 19) and 125rev.seq 
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(SEQ ID NO: 20) were used in the PCR reaction. The PCR 
reaction conditions were identical to those used in Example 
23 except the annealing temperature for the first seven 
cycles was 60°C. The purified PCR product was ligated with 
5 - 4070 base pair Ncol/Nhel pMON3452 vector fragment. 

Positive clones containing the AT-rich replacement insert 
were identified as described in Example 21. The resulting 
plasmid was designated pMON25182. The resulting gene, had 
19 codon changes at the 5* end of the gene. Plasmid, 
10 pMON25182 containing the gene sequence of (SEQ ID NO:36), 

DNA was retransf ormed into E. coli strain JM101 cells for 
protein expression. The protein expressed is the same as 
that expressed from pMON3452. 

15 Example 26 

Construction of PMON25186 

pMON25186 was constructed using the overlapping PCR 
20 primer approach described in Example 23. The synthetic 

oligomer primers 125for.seq (SEQ ID NO: 19) and 125rev.seq 
(SEQ ID NO: 20) were used in the PCR reaction. The PCR 
reaction conditions were identical to those used in Example 
23 except the annealing temperature for the first seven 
25 cycles was 60°C. The purified PCR product was ligated with 
the - 4060 base pair Ncol/Nhel pMON3457 vector fragment. 
Positive clones containing the AT-rich replacement insert 
were identified as described in Example 21. The resulting 
plasmid was designated pMON25186. The resulting gene, had 
30 19 codon changes at the 5* end of the gene. Plasmid, 

pMON25186 containing the gene sequence of (SEQ ID NO:40), 
DNA was retransf ormed into E. coli strain JM101 cells for 
protein expression. The protein expressed is the same as 
that expressed from pMON3457. 
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Construction of nMDM^Riai 

5 pMON25181 was constructed using PCR to amplify a DNA 

fragment from pMON3451 as the template using the oligomers 
96for.seq (SEQ ID NO:17) and 96rev.seq (SEQ ID N0:18). The 
oligomer 96for.seq was designed to create six codon changes. 
The PCR reaction conditions were the same as described in 

10 Example 25, except lOng of pMON3451 plasmid DNA was added. 
The purified PCR product was ligated with the - 3980 base 
pair Ncol/Nhel pMON3451 vector fragment. Positive clones 
containing the AT-rich replacement insert were identified as 
described in Example 21. The resulting plasmid was 

15 designated OMON25181. The resulting gene, had 6 codon 
changes at the 5' end of the gene. Plasmid, pMON25181 
containing the gene sequence of (SEQ ID NO-.35), DNA was 
retransformed into E. coli strain JM101 cells for protein 
expression. The protein expressed is the same as that 

20 expressed from pMON3451. 

Construction of nMON^IfiS 

25 

PMON25185 was constructed using PCR to amplify a DNA 
fragment from pMON3451 as the template using the oligomers 
96for.seq (SEQ ID NO:17) and 96rev.seq (SEQ ID NO:18). The 
oligomer 9697for.seq was designed to create six codon 

30 changes. Tie PCR reaction conditions were the same as 

described in Example 25, except lOng of pMON3456 plasmid DNA 
was added. The purified PCR product was ligated with the - 
3970 base pair Ncol/Nhel pMON3456 vector fragment. Positive 
clones containing the AT-rich replacement insert were 

35 identified as described in Example 21. The resulting 
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plasmid was designated pMON25185. The resulting gene, had 6 
codon changes at the 5* end of the gene. Plasmid, pMON25185 
containing the gene sequence of (SEQ ID NO: 39), DNA was 
retransf ormed into £. coli strain JM101 cells for protein 
5 expression. The protein expressed is the same as that 
expressed from pMON3456. 

EXAMPLE 29 

10 The G-CSF amino acid substitution variants of the 

present invention were made using PCR mutagenesis techniques 
as described in WO 94/12639 and WO 94/12638. These and other 
variants (i.e. amino acid substitutions, insertions or 
deletions and N- terminal or C-terminal extensions) could 

15 also be made, by one skilled in the art, using a variety of 
other methods including synthetic gene assembly or site- 
directed mutagenesis (see Taylor et al . , Nucl . Acids Res., 
13:7864-8785, 1985; Kunkel et al., Proc. Natl. Acad. Sci . 
USA, 82:488-492, 1985; Sambrook et al., Molecular Cloning: A 

20 Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989, WO 94/12639 and WO 
94/12638) . These substitutions can be made one at a time or 
in combination with other amino acid substitutions, and/or 
deletions, and/or insertions and/or extensions. After 

25 sequence verification of the changes, the plasmid DNA can be 
transfected into an appropriate mammalian cell, insect cell 
or bacterial strain such as E. coli for production. Known 
variants of G-CSF, which are active, include substitutions 
at positions 1 (Thr to Ser, Arg or Gly, 2 (Pro to Leu), 3 

30 (Leu to Arg or Ser) and 17 (Cys to Ser) and deletions of 

amino acids 1-11 (Kuga et al . Biochemicla and Biophysical 
Research Comm. 159:103-111, 1989) . It is understood that 
these G-CSF amino acid substitution variants could serve as 
the template sequence for the rearrangement of the amino 

35 acid sequence as described in the other examples. 
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Bioactivity determination of G-CSF amino acid substitution 
variants . 

The G-CSF amino acid substitution variants were assayed 
in the Baf/3 cell line, transfected with the human G-CSF 
receptor, proliferation assay to determine their bioactivity 
relative to native G-CSF, The G-CSF variants tested and 
their relative bioactivity are shown in Table 5. A *+" 
indicates that the activity was comparable to native G-CSF 
and indicates that the activity was significantly 
decreased or not detected. 
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TABLE 5 

CELL PROLIFERATION ACTIVITY OF G-CSF VARIANTS IN BAF/1 CEL T. 
LINE TRANSFECTED WTTH THE HUMAN Ci-CSF RECEPTO R 



aa positiff 


native aa 


mutant aa 


activitv * 


13 


Phe 


Ser 




13 


Phe 


His 




13 


Phe 


Thr 


♦ 


13 


Phe 


Pro 


+ 


16 


Lys 


Pro 


♦ 


16 


Lys 


Ser 




16 


Lys 


Thr 


+ 


16 


Lys 


His 




18 


Leu 


Pro 


+ 


18 


Leu 


Thr 




18 


Leu 


His 




18 


Leu 


Cys 




18 


Leu 


He 


4 


19 


Glu 


Ala 


- 


19 


Glu 


Thr 


_ 


19 


Glu 


Arg 


_ 


19 


Glu 


Pro 




19 


Glu 


Leu 




19 


Glu 


Gly 




19 


Glu 


Ser 




22 


Arg 


Tyr 


♦ 


22 


Arg 


Ser 




22 


Arg 


Ala 


♦ 


22 


Arg 


Val 


+ 


22 


Arg 


Thr 




24 


He 


Pro 




24 


He 


Leu 




24 


He 


Tyr 
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TABLE 5 cont. 



aa position 


native aa 


mutant aa 


activity * 


27 


Asp 


Gly 


♦ 


30 


Ala 


He 




30 


Ala 


Leu 


♦ 


34 


Lys 


Ser 




43 


His 


Gly 


+ 


43 


His 


Thr 




43 


His 


Val 




1 43 


His 


Lys 


+ 


43 


His 


Trp 


+ 


| 43 


His 


Ala 


♦ 


43 


His 


Arg 




43 


His 


Cys 


♦ 


43 


His 


Leu 


+ 


44 


Pro 


Arg 


+ 


44 


Pro 


Asp 




44 


Pro 


Val 


♦ ! 


44 


Pro 


Ala 


♦ 


44 


Pro 


His 




44 


Pro 


Gin 




44 


Pro 


Trp 




44 


Pro 


Gly 




44 


Pro 


Thr 






Glu 


Ala 


+ 




Glu 


Arg 




46 


Glu 


Phe 


♦ 


46 


Glu 


He 




47 


Leu 


Thr 




49 


Leu 


Phe 




49 


Leu 


Arg 


♦ 


49 


Leu 


Ser 
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TABLE 5 cont . 



da position 


T"» A t" 1 VO A A 
UatlVC CL CI 


mi t f* A n t" A A 


A£*fc i vi f V * 


50 


Leu 


HIS 


+ 


50 


Leu 


Pro 


+ 


51 


Gly 


Ser 




51 


Gly 


Met 


♦ 


54 


Leu 


His 




67 


Gin 


Lys 




67 


Gin 


Leu 




67 


Gin 


Cys 




67 


Gin 


Lys 




70 


Gin 


Pro 




70 


Gin 


T All 

Leu 




70 


Gin 


ATy 


+ 


70 


Gin 


ser 


+ 


104 


Asp 


Gly 




104 


Asp 


Val 




108 


Leu 


Ala 




108 


Leu 


Val 




108 


Leu 


Arg 




108 


Leu 


Gly 




108 


Leu 


Trp 


+ 


108 


Leu 


Gin 




115 


Thr 


His 


♦ 


115 


Thr 


Leu 




115 


Thr 


Ala 




115 


Thr 


He 




120 


Gin 


Gly 




120 


Gin 


Arg 




120 


Gin 


Lys 




120 


Gin 


His 
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TABLE 5 cont. 



aa position 


native aa 


mutant aa 


activity * 


12 J 


Glu 


Arg 




123 


Glu 


Phe 




12 J 


Glu 


Thr 


♦ 


144 


Phe 


His 


♦ 


144 


Phe 


Arg 


+ 


i i i 

144 


Phe 


Pro 




144 


Phe 


Leu 




i ji it 
144 


Phe 


Glu 


♦ 


146 


Arg 


Gin 




147 


Arg 


Gin 




156 


His 


Asp 




156 


His 


Ser 




156 


His 


Gly 




159 


Ser 


Arg 




159 


Ser 


Thr 




159 


Ser 


Tyr 




159 


Ser 


Val 


♦ 


159 


Ser 


Gly 


♦ 


162 


Glu 


Gly 




162 


Glu 


Trp 


+ 


162 


Glu 


Leu 




163 


Val 


Arg 


♦ 


163 


Val 


Ala 


♦ 


163 


Val 


Gly 


♦ 


165 


Tyr 


Cys 


not determined 


169 


Ser 


Leu 




169 


Ser 


Cys 


+ 


169 


Ser 


Arg 




170 


His 


Arg 


♦ 


170 


His 


Ser 
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EXAMLPLE 10-17 



Examples 30-37 were made in a similar manner as 
described in Example 6 using the plasmid pMON13037 as the 
template and the oligonucleotide primers indicated in Table 
6. The resulting gene and the designated plasmid pMON # and 
the protein encoded are indicated in Table 6. 
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TABLE 6 



Example 


breakpoint 


primers 


resulting gene 


resulting 
jprotein 


30 


48/49 


Lai U 

(SEQ ID NO: 68) 

4 8s top 
(SEQ ID NO: 69) 


PMON3460 
(SEQ ID NO: 86) 


(SEQ ID NO: 95) 




76/77 


/ /start 
(SEQ ID NO:70) 

76stop 
(SEQ ID NO:71) 


PMON3461 
(SEQ ID NO: 87) 


(SEQ ID NO:96) 


j t* 


ftl /A9 


82start 
(SEQ ID NO: 72) 

81stop 
(SEQ ID NO:73) 


PMON3462 
(SEQ ID NO:88) 


(SEQ ID NO: 97) 






84start 

(SEQ ID NO: 74) 

83stop 
(SEQ ID NO:75) 


PMON3463 
(SEQ ID NO: 88) 


(SEQ ID NO:98) 






91start 
(SEQ ID NO:76) 

90stop 
(SEQ ID NO:77) 


PMON3464 
(SEQ ID NO: 89) 


(SEQ ID NO: 99) 


35 


111/112 


112start 

• A mm 0 w U K \» 

(SEQ ID NO:78) 

lllstop 
(SEQ ID NO:79) 


ywwii J t O w 

(SEQ ID NO:90) 


(SEQ ID NO: 100) 


36 


116/117 


117start 
(SEQ ID NO: 80) 

116stop 
(SEQ ID NO: 81) 


PMON3466 
(SEQ ID NO:91) 


(SEQ ID NO: 101) 


37 


118/119 


119start 
(SEQ ID NO: 62) 

118stop 
(SEQ ID NO: 83) 


PMON3467 
(SEQ ID NO: 92) 


(SEQ ID NO: 102) 



The G-CSF receptor agonist genes in pMON3640, pMON3461, 
PMON3462, pMON3463, pMON3464, pMON3465, pMON3466 and 
pMON3467 were transferred to an E. coli expression vector, 
PMON2341, as an Ncol/Hindlll restriction fragment, resulting 
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in the plasmids pMON3468, pMON3469, pMON3470, pMON3471, 
pMON3472, pMON3473, pMON3474 and pMON3498 respectively. 

EXAMPLE 38 

5 

The plasmid, pMON3468, resulted in low expression 
levels in E. coli of the desired G-CSF receptor agonist. The 
5' end of the gene was redesigned to use codon selection 
that was AT rich to increase expression levels. The 
10 oligonucleotides, Z4849AT.for (SEQ ID NO:84) and Z4849AT.rev 

(SEQ ID NO:85), were used to re-engineer the gene. The 
resulting plasmid, pMON3499, containing the gene (SEQ ID 
NO: 94) encodes the G-CSF receptor agonist of (SEQ ID 
NO:103) . 

15 

EXAMPLE 39 

The G-CSF receptor agonists were assayed in the Baf/3 
cell line, transfected with the human G-CSF receptor, 
20 (Baf /3-G-CSF) proliferation assay to determine their 

bioactivity relative to native G-CSF. The activity of the 
receptor agonists is shown in Table 7. 
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TABLE 7 

G-CSF receptor agonist activity in Baf/3-G-CSF cell 
proliferation assay 



pMON# 


breakpoint 


Expression 


E. coli 


EC50 <pM) 


native G-CSF 








60 pM 


PMON25182 


125/126 






38 pM 


PMON25183 


132/133 


♦ 


+ 


58 pM 


PMON25184 


141/142 






70 pM 


PMON25186 


125/126 






92 pM 


PMON25187 


132/133 


+ 


♦ 


83 pM 


nMON251 Aft 


Ill / A » 




+ 


4 1 pM 


pMON3450 


38/39 






121 pM 


PMON3455 


38/39 


+ 


+ 


102 pM 


PMON3499 


48/49 




♦ 


137 pM 


pMON3470 


81/82 


♦ 


♦ 


no 

activity 
detected 


pMON3473 


111/112 


♦ 
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Additional techniques for the construction of the 
variant genes, recombinant protein expression , protein 
purification, protein characterization, biological activity 
determination can be found in WO 94/12639, WO 94/12638, WO 
95/20976, WO 95/21197, WO 95/20977, WO 95/21254 which are 
hereby incorporated by reference in their entirety. 

All references, patents or applications cited herein 
are incorporated by reference in their entirety as if 
written herein. 

Various other examples will be apparent to the person 
skilled in the art after reading the present disclosure 
without departing from the spirit and scope of the 
invention. It is intended that all such other examples be 
included within the scope of the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: G. D. Searle & Co. 

(B) STREET: P. 0. Box 5110 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP) : 60680 
<G) TELEPHONE: (708)470-6501 
(H) TELEFAX: (708)470-6881 

(A) NAME: Monsanto Company 

(B) STREET: 800 North Lindbergh Boulevard 

(C) CITY: St. Louis 

(D) STATE: Missouri 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP) : 63167 

(G) TELEPHONE: (314)647*3131 

(H) TELEFAX: (314)694-5435 

(ii) TITLE OF INVENTION: G-CSF Receptor Agonists 
(iii) NUMBER OF SEQUENCES: 103 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US 2907 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/004,382 

(B) FILING DATE: 05-OCT-1995 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 



WO 97/12977 



PCT/US96/15935 



97 



(B) LOCATION: 1 

(D) OTHER INFORMATION :/note= "Xaa at position 1 is Thr, 
Ser, Arg, Tyr or Gly; " 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATIONS 

(D) OTHER INFORMATION:/ no te= "Xaa at position 2 is Pro or 
Leu; " 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION : 3 

(D) OTHER INFORMATION: /note* "Xaa at position 3 is Leu, 
Arg, Tyr or Ser; ■ 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /note* "Xaa at position 13 is Phe, 
Ser, His, Thr or Pro;" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /note* "Xaa at position 16 is Lys, 
Pro, Ser, thr or His;" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /note* "Xaa at position 17 is Cys, 
Ser, Gly, Ala, lie, Tyr or Arg;" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /note* "Xaa at position 18 is Leu, 
Thr, Pro, His, lie or Cys;" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= "Xaa at position 22 is Arg, 
Tyr, Ser, Thr or Ala; ■ 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 24 

(D) OTHER INFORMATION: /n te= "Xaa at position 24 is He, 
Pro, Tyr or Leu; " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
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(B) LOCATION: 27 

(D) OTHER INFORMATION :/note= "Xaa at position 27 is Asp, 
or Gly; " 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 30 

(D) OTHER INFORMATION: /note= "Xaa at position 30 is Ala, 
lie, Leu or Gly; ■ 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 34 

(D) OTHER INFORMATION: /note* "Xaa at position 34 is Lys 
or Ser ; " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 36 

(D) OTHER INFORMATION: /note= "Xaa at position 36 is Cys 
or Ser; " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 42 

(D) OTHER INFORMATION :/note= "Xaa at position 42 is Cys 
or Ser; " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 43 

(D) OTHER INFORMATION: /note- "Xaa at position 43 is His, 
Thr, Gly, Val, Lys, Trp, Ala, Arg, Cys, or Leu; " 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 44 

(D) OTHER INFORMATION: /note- "Xaa at position 44 is Pro, 
Gly, Arg, Asp, Val, Ala, His, Trp, Gin, or Thr;" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 46 

(D) OTHER INFORMATION: /note- "Xaa at position 46 is Glu, 
Arg, Phe, Arg, He or Ala; ■ 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 47 

(D) OTHER INFORMATION: /note- "Xaa at position 47 is Leu 
or Thr ; " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
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( B) LOCATION: 49 

(D) OTHER INFORMATION: /note= «Xaa at position 49 is Leu, 
Phe , Arg or Ser ; " 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 50 

(D) OTHER INFORMATION: /note= "Xaa at position 50 is Leu, 
lie, His, Pro or Tyr; " 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 54 

(D) OTHER INFORMATION: /note* -Xaa at position 54 is Leu 
or His;" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 64 

(D) OTHER INFORMATION: /note= "Xaa at position 64 is Cys 
or Ser; ■ 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 67 

(D) OTHER INFORMATION: /note« "Xaa at position 67 is Gin, 
Lys , Leu or Cys ; " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 70 

(D) OTHER INFORMATION: /notes «Xaa at position 70 is Gin, 
Pro, Leu, Arg or Ser;" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
<B) LOCATION: 74 

(D) OTHER INFORMATION: /notes "Xaa at position 74 is Cys 
or Ser;" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 104 

(D) OTHER INFORMATION: /note* "Xaa at position 104 is Asp, 
Gly or Val;" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 108 

(D) OTHER INFORMATION :/n tes "Xaa at position 108 is Leu, 
Ala, Val. Arg, Trp, Gin r Gly;" 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 
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(B) LOCATION: 115 

(D) OTHER INFORMATION :/note= "Xaa at position 115 is Thr, 
His, Leu or Ala; ■ 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 120 

(D) OTHER INFORMATION: /note= "Xaa at position 120 is Gin, 
Gly, Arg, Lys or His" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 123 

(D) OTHER INFORMATION: /notea "Xaa at position 123 is Glu, 
Arg, Phe or Thr" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 144 

(D) OTHER INFORMATION:/ no te« "Xaa at position 144 is Phe, 
His, Arg, Pro, Leu, Gin or Glu;" 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 146 

(D) OTHER INFORMATION: /note* "Xaa at positionl46 is Arg 
or Gin;" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 147 

(D) OTHER INFORMATION: /note* "Xaa ap position 147 is Arg 
or Gin; " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 156 

(D) OTHER INFORMATION: /note- "Xaa at position 156 is His, 
Gly or Ser; " 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 159 

(D) OTHER INFORMATION: /note- "Xaa at position 159 is Ser, 
Arg, Thr, Tyr, Val or Gly;" 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 162 

(D) OTHER INFORMATION :/n te» "Xaa at position 162 is Glu, 
Leu, Gly or Trp; " 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 
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(B) LOCATION: 163 

(D) OTHER INFORMATION: /note= "Xaa at position 163 is Val, 
Gly, Arg or Ala; " 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 169 

(D) OTHER INFORMATION :/note= "Xaa at position 169 is Arg, 
Ser, Leu, Arg or Cys;" 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 170 

(D) OTHER INFORMATION :/note* "Xaa at position 170 is His, 
Arg or Ser; ■ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Xaa Xaa Xaa Gly Pro Ala Ser Ser Leu Pro Gin Ser Xaa Leu Leu Xaa 
15 10 15 

Xaa Xaa Glu Gin Val Xaa Lys Xaa Gin Gly Xaa Gly Ala Xaa Leu Gin 
20 25 30 

Glu Xaa Leu Xaa Ala Thr Tyr Lys Leu Xaa Xaa Xaa Glu Xaa Xaa Val 
35 40 45 

Xaa Xaa Gly His Ser Xaa Gly He Pro Trp Ala Pro Leu Ser Ser Xaa 
50 55 60 

Pro Ser Xaa Ala Leu Xaa Leu Ala Gly Xaa Leu Ser Gin Leu His Ser 
65 70 75 80 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
85 90 95 

Pro Glu Leu Gly Pro Thr Leu Xaa Thr Leu Gin Xaa Asp Val Ala Asp 
100 105 110 

Phe Ala Xaa Thr He Trp Gin Gin Met Glu Xaa Xaa Gly Met Ala Pro 
115 120 125 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Xaa 
130 135 140 

Gin Xaa Xaa Ala Gly Gly Val Leu Val Ala Ser Xaa Leu Gin Xaa Phe 
145 150 155 160 

Leu Xaa Xaa Ser Tyr Arg Val Leu Xaa Xaa Leu Ala Gin Pr 
165 170 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gly Gly Gly Ser 
1 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCTCTGAGAG CCGCCAGAGC CGCCAGAGGG CTGCGCAAGG TGGCGTAGAA CGCG 
54 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "DNA (synthetic)' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CAGCCCTCTG GCGGCTCTGG CGGCTCTCAG AGCTTCCTGC TCAAGTCTTT AGAG 
54 
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(2). INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGGCTGCGCA AGGTGGCG 
18 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACACCATTGG GCCCTGCCAG C 
21 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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GATCGACCAT GGCTTACAAG CTGTGCCACC CC 
32 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CGATCGAAGC TTATTAGGTG GCACACAGCT TCTCCT 
36 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GATCGACCAT GGCTCCCGAG TTGGGTCCCA CC 
32 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "DNA (synthetic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CGATCGAAGC TTATTAGGAT ATCCCTTCCA GGGCCT 
36 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GATCGACCAT GGCTATGGCC CCTGCCCTGC AG 
32 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CGATCGAAGC TTATTATCCC AGTTCTTCCA TCTGCT 
36 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ther nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)* 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 13: 

GATCGACCAT GGCTACCCAG GGTGCCATGC CG 
32 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = - DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CGATCGAAGC TTATTAGGGC TGCAGGGCAG GGGCCA 
36 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GATCGACCAT GGCTTCTGCT TTCCAGCGCC GG 
32 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CGATCGAAGC TTATTAGGCG AAGGCCGGCA TGGCAC 
36 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATATCCATGG CTCCGGAACT GGGTCCAACT CTG 
33 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ACCTCCAGGA AGCTCTGCAG ATGG 
24 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA ( synthetic) • 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TATATCCATG GCTATGGCTC CAGCTCTGCA ACCAACTCAA GGTGCAATGC CAGCATTTGC 
60 

ATCTG 
65 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GATGGCTAGC AACCAGAACA CCACCTGCAC GACGTTGAAA AGCAGATGCA AATGCTGGCA 
60 

TTG 
63 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (synthetic)" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

TATATCCATG GCTACTCAAG GTGCTATGCC AGCTTTTGCT TCTGCTTTTC AACGTCG 
57 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GCAGATGGCT AGCAACCAGA ACACCACCTG CACGACGTTG AAAAGCAGAA GCAAAAGC 
58 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CATGGCTTCT GCTTTTCAAC GTCGTGCAGG TGGTGTTCTG GTTG 
44 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ther nucleic acid 

(A) DESCRIPTION: /desc ■ "DNA (synthetic)" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CTAGCAACCA GAACACCACC TGCACGACGT TGAAAAGCAG AAGC 
44 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATGGCTTACA AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC TCTGGGCATC 
60 



CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC 
120 



CAACTCCATA GCGGCCTTTT CCTCTACCAG GGGCTCCTGC AGGCCCTGGA AGGGATATCC 
180 



CCCGAGTTGG GTCCCACCTT GGACACACTG CAGCTGGACG TCGCCGACTT TGCCACCACC 
240 



ATCTGGCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC TGCAGCCCAC CCAGGGTGCC 
300 



ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC CGGGCAGGAG GGGTCCTGGT TGCTAGCCAT 
360 



CTGCAGAGCT TCCTGGAGGT GTCGTACCGC GTTCTACGCC ACCTTGCGCA GCCCTCTGGC 
420 



GGCTCTGGCG GCTCTCAGAG CTTCCTGCTC AAGTCTTTAG AGCAAGTGAG GAAGATCCAG 
480 



GGCGATGGCG CAGCGCTCCA GGAGAAGCTG TGTGCCACCT AATAA 
525 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc * "DNA (synthetic) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATGGCTCCCG AGTTGGGTCC CACCTTGGAC ACACTGCAGC TGGACGTCGC CGACTTTGCC 
60 

ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC CTGCCCTGCA GCCCACCCAG 
120 

GGTGCCATGC CGGCCTTCGC CTCTGCTTTC CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT 
180 

AGCCATCTGC AGAGCTTCCT GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC 
240 

TCTGGCGGCT CTGGCGGCTC TCAGAGCTTC CTGCTCAAGT CTTTAGAGCA AGTGAGGAAG 
300 

ATCCAGGGCG ATGGCGCAGC GCTCCAGGAG AAGCTGTGTG CCACCTACAA GCTGTGCCAC 
360 

CCCGAGGAGC TGGTGCTGCT CGGACACTCT CTGGGCATCC CCTGGGCTCC CCTGAGCTCC 
420 

TGCCCCAGCC AGGCCCTGCA GCTGGCAGGC TGCTTGAGCC AACTCCATAG CGGCCTTTTC 
480 

CTCTACCAGG GGCTCCTGCA GGCCCTGGAA GGGATATCCT AATAA 
525 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "DNA (synthetic)" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATGGCTATGG CCCCTGCCCT GCAGCCCACC CAGGGTGCCA TGCCGGCCTT CGCCTCTGCT 
60 

TTCCAGCGCC GGGCAGGAGG GGTCCTGGTT GCTAGCCATC TGCAGAGCTT CCTGGAGGTG 
120 

TCGTACCGCG TTCTACGCCA CCTTGCGCAG CCCTCTGGCG GCTCTGGCGG CTCTCAGAGC 
180 

TTCCTGCTCA AGTCTTTAGA GCAAGTGAGG AAGATCCAGG GCGATGGCGC AGCGCTCCAG 
240 

GAGAAGCTGT GTGCCACCTA CAAGCTGTGC CACCCCGAGG AGCTGGTGCT GCTCGGACAC 
300 

TCTCTGGGCA TCCCCTGGGC TCCCCTGAGC TCCTGCCCCA GCCAGGCCCT GCAGCTGGCA 
360 

GGCTGCTTGA GCCAACTCCA TAGCGGCCTT TTCCTCTACC AGGGGCTCCT GCAGGCCCTG 
420 

GAAGGGATAT CCCCCGAGTT GGGTCCCACC TTGGACACAC TGCAGCTGGA CGTCGCCGAC 
480 

TTTGCCACCA CCATCTGGCA GCAGATGGAA GAACTGGGAT AATAA 
525 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "DNA (synthetic)- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATGGCTACCC AGGGTGCCAT GCCGGCCTTC GCCTCTGCTT TCCAGCGCCG GGCAGGAGGG 
60 

GTCCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT CGTACCGCGT TCTACGCCAC 
120 

CTTGCGCAGC CCTCTGGCGG CTCTGGCGGC TCTCAGAGCT TCCTGCTCAA GTCTTTAGAG 
180 
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CAAGTGAGGA AGATCCAGGG CGATGGCGCA GCGCTCCAGG AGAAGCTGTG TGCCACCTAC 
240 

AAGCTGTGCC ACCCCGAGGA GCTGGTGCTG CTCGGACACT CTCTGGGCAT CCCCTGGGCT 
300 

CCCCTGAGCT CCTGCCCCAG CCAGGCCCTG CAGCTGGCAG GCTGCTTGAG CCAACTCCAT 
360 

AGCGGCCTTT TCCTCTACCA GGGGCTCCTG CAGGCCCTGG AAGGGATATC CCCCGAGTTG 
420 

GGTCCCACCT TGGACACACT GCAGCTGGAC GTCGCCGACT TTGCCACCAC CATCTGGCAG 
480 

CAGATGGAAG AACTGGGAAT GGCCCCTGCC CTGCAGCCCT AATAA 
525 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc * "DNA (synthetic) - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGGCTTCTG CTTTCCAGCG CCGGGCAGGA GGGGTCCTGG TTGCTAGCCA TCTGCAGAGC 
60 

TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC AGCCCTCTGG CGGCTCTGGC 
120 

GGCTCTCAGA GCTTCCTGCT CAAGTCTTTA GAGCAAGTGA GGAAGATCCA GGGCGATGGC 
180 

GCAGCGCTCC AGGAGAAGCT GTGTGCCACC TACAAGCTGT GCCACCCCGA GGAGCTGGTG 
240 

CTGCTCGGAC ACTCTCTGGG CATCCCCTGG GCTCCCCTGA GCTCCTGCCC CAGCCAGGCC 
300 

CTGCAGCTGG CAGGCTGCTT GAGCCAACTC CATAGCGGCC TTTTCCTCTA CCAGGGGCTC 
360 

CTGCAGGCCC TGGAAGGGAT ATCCCCCGAG TTGGGTCCCA CCTTGGACAC ACTGCAGCTG 
420 
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GACGTCGCCG ACTTTGCCAC CACCATCTGG CAGCAGATGG AAGAACTGGG AATGGCCCCT 
480 



GCCCTGCAGC CCACCCAGGG TGCCATGCCG GCCTTCGCCT AATAA 
525 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ATGGCTTACA AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC TCTGGGCATC 
60 



CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC 
120 

CAACTCCATA GCGGCCTTTT CCTCTACCAG GGGCTCCTGC AGGCCCTGGA AGGGATATCC 
180 



CCCGAGTTGG GTCCCACCTT GGACACACTG CAGCTGGACG TCGCCGACTT TGCCACCACC 
240 



ATCTGGCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC TGCAGCCCAC CCAGGGTGCC 
300 



ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC CGGGCAGGAG GGGTCCTGGT TGCTAGCCAT 
360 



CTGCAGAGCT TCCTGGAGGT GTCGTACCGC GTTCTACGCC ACCTTGCGCA GCCCACACCA 
420 



TTGGGCCCTG CCAGCTCCCT GCCCCAGAGC TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA 
480 



AAGATCCAGG GCGATGGCGC AGCGCTCCAG GAGAAGCTGT GTGCCACCTA ATAA 
534 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATGGCTCCCG AGTTGGGTCC CACCTTGGAC ACACTGCAGC TGGACGTCGC CGACTTTGCC 
60 

ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC CTGCCCTGCA GCCCACCCAG 
120 

GGTGCCATGC CGGCCTTCGC CTCTGCTTTC CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT 
180 

AGCCATCTGC AGAGCTTCCT GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC 
240 

ACACCATTGG GCCCTGCCAG CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA 
300 

GTGAGAAAGA TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC CACCTACAAG 
360 

CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC TGGGCATCCC CTGGGCTCCC 
420 

CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG CTGGCAGGCT GCTTGAGCCA ACTCCATAGC 
480 

GGCCTTTTCC TCTACCAGGG GCTCCTGCAG GCCCTGGAAG GGATATCCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "DNA (Synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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ATGGCTATGG CCCCTGCCCT GCAGCCCACC CAGGGTGCCA TGCCGGCCTT CGCCTCTGCT 
60 

TTCCAGCGCC GGGCAGGAGG GGTCCTGGTT GCTAGCCATC TGCAGAGCTT CCTGGAGGTG 
120 

TCGTACCGCG TTCTACGCCA CCTTGCGCAG CCCACACCAT TGGGCCCTGC CAGCTCCCTG 
180 



CCCCAGAGCT TCCTGCTCAA GTCTTTAGAG CAAGTGAGAA AGATCCAGGG CGATGGCGCA 
240 

GCGCTCCAGG AGAAGCTGTG TGCCACCTAC AAGCTGTGCC ACCCCGAGGA GCTGGTGCTG 
300 

CTCGGACACT CTCTGGGCAT CCCCTGGGCT CCCCTGAGCT CCTGCCCCAG CCAGGCCCTG 
360 

CAGCTGGCAG GCTGCTTGAG CCAACTCCAT AGCGGCCTTT TCCTCTACCA GGGGCTCCTG 
420 

CAGGCCCTGG AAGGGATATC CCCCGAGTTG GGTCCCACCT TGGACACACT GCAGCTGGAC 
480 

GTCGCCGACT TTGCCACCAC CATCTGGCAG CAGATGGAAG AACTGGGATA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATGGCTACCC AGGGTGCCAT GCCGGCCTTC GCCTCTGCTT TCCAGCGCCG GGCAGGAGGG 
60 

GTCCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT CGTACCGCGT TCTACGCCAC 
120 

CTTGCGCAGC CCACACCATT GGGCCCTGCC AGCTCCCTGC CCCAGAGCTT CCTGCTCAAG 
180 
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TCTTTAGAGC AAGTGAGAAA GATCCAGGGC GATGGCGCAG CGCTCCAGGA GAAGCTGTGT 
240 

GCCACCTACA AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC TCTGGGCATC 
300 

CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC 
360 

CAACTCCATA GCGGCCTTTT CCTCTACCAG GGGCTCCTGC AGGCCCTGGA AGGGATATCC 
420 

CCCGAGTTGG GTCCCACCTT GGACACACTG CAGCTGGACG TCGCCGACTT TGCCACCACC 
480 

ATCTGGCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC TGCAGCCCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATGGCTTCTG CTTTCCAGCG CCGGGCAGGA GGGGTCCTGG TTGCTAGCCA TCTGCAGAGC 
60 

TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC AGCCCACACC ATTGGGCCCT 
120 

GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC AAGTCTTTAG AGCAAGTGAG AAAGATCCAG 
180 

GGCGATGGCG CAGCGCTCCA GGAGAAGCTG TGTGCCACCT ACAAGCTGTG CCACCCCGAG 
240 

GAGCTGGTGC TGCTCGGACA CTCTCTGGGC ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC 
300 

AGCCAGGCCC TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCCT TTTCCTCTAC 
360 

CAGGGGCTCC TGCAGGCCCT GGAAGGGATA TCCCCCGAGT TGGGTCCCAC CTTGGACACA 
420 
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CTGCAGCTGG ACGTCGCCGA CTTTGCCACC ACCATCTGGC AGCAGATGGA AGAACTGGGA 
480 



ATGGCCCCTG CCCTGCAGCC CACCCAGGGT GCCATGCCGG CCTTCGCCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATGGCTCCGG AACTGGGTCC AACTCTGGAC ACACTGCAGC TGGACGTCGC CGACTTTGCC 
60 

ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC CTGCCCTGCA GCCCACCCAG 
120 

GGTGCCATGC CGGCCTTCGC CTCTGCTTTC CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT 
180 

AGCCATCTGC AGAGCTTCCT GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC 
240 

ACACCATTGG GCCCTGCCAG CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA 
300 

GTGAGAAAGA TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC CACCTACAAG 
360 

CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC TGGGCATCCC CTGGGCTCCC 
420 

CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG CTGGCAGGCT GCTTGAGCCA ACTCCATAGC 
480 

GGCCTTTTCC TCTACCAGGG GCTCCTGCAG GCCCTGGAAG GGATATCCTA A 
531 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ATGGCTATGG CTCCAGCTCT GCAACCAACT CAAGGTGCAA TGCCAGCATT TGCATCTGCT 
60 

TTTCAACGTC GTGCAGGTGG TGTTCTGGTT GCTAGCCATC TGCAGAGCTT CCTGGAGGTG 
120 

TCGTACCGCG TTCTACGCCA CCTTGCGCAG CCCACACCAT TGGGCCCTGC CAGCTCCCTG 
180 

CCCCAGAGCT TCCTGCTCAA GTCTTTAGAG CAAGTGAGAA AGATCCAGGG CGATGGCGCA 
240 

GCGCTCCAGG AGAAGCTGTG TGCCACCTAC AAGCTGTGCC ACCCCGAGGA GCTGGTGCTG 
300 

CTCGGACACT CTCTGGGCAT CCCCTGGGCT CCCCTGAGCT CCTGCCCCAG CCAGGCCCTG 
360 

CAGCTGGCAG GCTGCTTGAG CCAACTCCAT AGCGGCCTTT TCCTCTACCA GGGGCTCCTG 
420 

CAGGCCCTGG AAGGGATATC CCCCGAGTTG GGTCCCACCT TGGACACACT GCAGCTGGAC 
480 

GTCGCCGACT TTGCCACCAC CATCTGGCAG CAGATGGAAG AACTGGGATA A 
531 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
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ATGGCTACTC AAGGTGCTAT GCCAGCTTTT GCTTCTGCTT TTCAACGTCG TGCAGGTGGT 



GTTCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT CGTACCGCGT TCTACGCCAC 
120 

CTTGCGCAGC CCACACCATT GGGCCCTGCC AGCTCCCTGC CCCAGAGCTT CCTGCTCAAG 
180 



TCTTTAGAGC AAGTGAGAAA GATCCAGGGC GATGGCGCAG CGCTCCAGGA GAAGCTGTGT 
240 



GCCACCTACA AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC TCTGGGCATC 
300 



CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC 
360 

CAACTCCATA GCGGCCTTTT CCTCTACCAG GGGCTCCTGC AGGCCCTGGA AGGGATATCC 
420 



CCCGAGTTGG GTCCCACCTT GGACACACTG CAGCTGGACG TCGCCGACTT TGCCACCACC 
480 



ATCTGGCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC TGCAGCCCTA A 
531 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ATGGCTTCTG CTTTTCAACG TCGTGCAGGT GGTGTTCTGG TTGCTAGCCA TCTGCAGAGC 
60 



TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC AGCCCACACC ATTGGGCCCT 
120 



GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC AAGTCTTTAG AGCAAGTGAG AAAGATCCAG 
180 
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GGCGATGGCG CAGCGCTCCA GGAGAAGCTG TGTGCCACCT ACAAGCTGTG CCACCCCGAG 
240 

GAGCTGGTGC TGCTCGGACA CTCTCTGGGC ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC 
300 

AGCCAGGCCC TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCCT TTTCCTCTAC 
360 

CAGGGGCTCC TGCAGGCCCT GGAAGGGATA TCCCCCGAGT TGGGTCCCAC CTTGGACACA 
420 

CTGCAGCTGG ACGTCGCCGA CTTTGCCACC ACCATCTGGC AGCAGATGGA AGAACTGGGA 
480 

ATGGCCCCTG CCCTGCAGCC CACCCAGGGT GCCATGCCGG CCTTCGCCTA A 
531 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATGGCTCCGG AACTGGGTCC AACTCTGGAC ACACTGCAGC TGGACGTCGC CGACTTTGCC 
60 

ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC CTGCCCTGCA GCCCACCCAG 
120 

GGTGCCATGC CGGCCTTCGC CTCTGCTTTC CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT 
180 

AGCCATCTGC AGAGCTTCCT GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC 
240 

TCTGGCGGCT CTGGCGGCTC TCAGAGCTTC CTGCTCAAGT CTTTAGAGCA AGTGAGAAAG 
300 

ATCCAGGGCG ATGGCGCAGC GCTCCAGGAG AAGCTGTGTG CCACCTACAA GCTGTGCCAC 
360 

CCCGAGGAGC TGGTGCTGCT CGGACACTCT CTGGGCATCC CCTGGGCTCC CCTGAGCTCC 
420 
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TGCCCCAGCC AGGCCCTGCA GCTGGCAGGC TGCTTGAGCC AACTCCATAG CGGCCTTTTC 
480 

CTCTACCAGG GGCTCCTGCA GGCCCTGGAA GGGATATCCT AA 
522 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

ATGGCTATGG CTCCAGCTCT GCAACCAACT CAAGGTGCAA TGCCAGCATT TGCATCTGCT 
60 

TTTCAACGTC GTGCAGGTGG TGTTCTGGTT GCTAGCCATC TGCAGAGCTT CCTGGAGGTG 
120 

TCGTACCGCG TTCTACGCCA CCTTGCGCAG CCCTCTGGCG GCTCTGGCGG CTCTCAGAGC 
180 

TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA AAGATCCAGG GCGATGGCGC AGCGCTCCAG 
240 

GAGAAGCTGT GTGCCACCTA CAAGCTGTGC CACCCCGAGG AGCTGGTGCT GCTCGGACAC 
300 

TCTCTGGGCA TCCCCTGGGC TCCCCTGAGC TCCTGCCCCA GCCAGGCCCT GCAGCTGGCA 
360 



GGCTGCTTGA GCCAACTCCA TAGCGGCCTT TTCCTCTACC AGGGGCTCCT GCAGGCCCTG 
420 



GAAGGGATAT CCCCCGAGTT GGGTCCCACC TTGGACACAC TGCAGCTGGA CGTCGCCGAC 
480 

TTTGCCACCA CCATCTGGCA GCAGATGGAA GAACTGGGAT AA 
522 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATGGCTACTC AAGGTGCTAT GCCAGCTTTT GCTTCTGCTT TTCAACGTCG TGCAGGTGGT 
60 

GTTCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT CGTACCGCGT TCTACGCCAC 
120 

CTTGCGCAGC CCTCTGGCGG CTCTGGCGGC TCTCAGAGCT TCCTGCTCAA GTCTTTAGAG 
180 

CAAGTGAGAA AGATCCAGGG CGATGGCGCA GCGCTCCAGG AGAAGCTGTG TGCCACCTAC 
240 

AAGCTGTGCC ACCCCGAGGA GCTGGTGCTG CTCGGACACT CTCTGGGCAT CCCCTGGGCT 
300 

CCCCTGAGCT CCTGCCCCAG CCAGGCCCTG CAGCTGGCAG GCTGCTTGAG CCAACTCCAT 
360 

AGCGGCCTTT TCCTCTACCA GGGGCTCCTG CAGGCCCTGG AAGGGATATC CCCCGAGTTG 
420 

GGTCCCACCT TGGACACACT GCAGCTGGAC GTCGCCGACT TTGCCACCAC CATCTGGCAG 
480 

CAGATGGAAG AACTGGGAAT GGCCCCTGCC CTGCAGCCCT AA 
522 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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ATGGCTTCTG CTTTTCAACG TCGTGCAGGT GGTGTTCTGG TTGCTAGCCA TCTGCAGAGC 
60 

TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC AGCCCTCTGG CGGCTCTGGC 
120 

GGCTCTCAGA GCTTCCTGCT CAAGTCTTTA GAGCAAGTGA GAAAGATCCA GGGCGATGGC 
1B0 

GCAGCGCTCC AGGAGAAGCT GTGTGCCACC TACAAGCTGT GCCACCCCGA GGAGCTGGTG 
240 

CTGCTCGGAC ACTCTCTGGG CATCCCCTGG GCTCCCCTGA GCTCCTGCCC CAGCCAGGCC 
300 

CTGCAGCTGG CAGGCTGCTT GAGCCAACTC CATAGCGGCC TTTTCCTCTA CCAGGGGCTC 
360 

CTGCAGGCCC TGGAAGGGAT ATCCCCCGAG TTGGGTCCCA CCTTGGACAC ACTGCAGCTG 
420 

GACGTCGCCG ACTTTGCCAC CACCATCTGG CAGCAGATGG AAGAACTGGG AATGGCCCCT 
480 

GCCCTGCAGC CCACCCAGGG TGCCATGCCG GCCTTCGCCT AA 
522 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Tyr Lys Leu Cya His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
15 10 15 

Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
20 25 30 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
35 40 45 

Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr 
50 55 60 
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Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
65 70 75 80 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
85 90 95 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
100 105 110 

Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
115 120 125 

Val Leu Arg His Leu Ala Gin Pro Ser Gly Gly Ser Gly Gly Ser Gin 
130 135 140 

Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp 
145 150 155 160 

Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
165 170 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
1 5 10 15 

Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
20 25 30 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
35 40 45 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
50 55 60 

Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Ser Gly 
65 70 75 80 

Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
85 90 95 
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Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
100 105 no 

Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
115 120 125 

Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 
130 135 140 

Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
145 150 155 160 

Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
165 170 

(2) INFORMATION FOR SEQ ID NO: 45: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 45: 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala 
15 10 15 



Gly Gly Val Leu Val Ala Ser His 
20 

Tyr Arg Val Leu Arg His Leu Ala 
35 40 

Ser Gin Ser Phe Leu Leu Lys Ser 
50 55 

Gly Asp Gly Ala Ala Leu Gin Glu 
65 70 

Cys His Pro Glu Glu Leu Val Leu 
85 

Trp Ala Pro Leu Ser Ser Cys Pro 
100 

Cys Leu Ser Gin Leu His Ser Gly 
115 120 



Leu Gin Ser Phe Leu Glu Val Ser 
25 30 

Gin Pro Ser Gly Gly Ser Gly Gly 
45 

Leu Glu Gin Val Arg Lys He Gin 
60 

Lys Leu Cys Ala Thr Tyr Lys Leu 
75 80 

Leu Gly His Ser Leu Gly He Pro 
90 95 

Ser Gin Ala Leu Gin Leu Ala Gly 
105 110 

Leu Phe Leu Tyr Gin Gly Leu Leu 
125 
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Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr 
130 135 140 

Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp Gin Gin Met 
145 150 155 160 

Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro 
165 170 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

TGGAATAAAA AAGAGAGAAG GAAAAGGATA GAAGAAGGGG GGGGAAGGGA GAAAAGGCAA 
60 

TTCGGAGGTA ACGAAGAAGC GGTGGGAAGG GGTATGAAAA AAATTTGGTG GGTAAAAG 
118 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
15 10 15 

Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
20 25 30 

Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser Leu 
35 40 45 
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Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
' 50 55 60 

Leu Cy9 Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu 
65 70 75 80 

Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser 
85 90 95 

Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu 
100 105 110 

Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu 
115 120 125 

Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala 
130 135 140 

Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu 
145 150 155 160 

Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
165 170 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
15 10 15 

Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
20 25 30 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
35 40 45 

Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr 
50 55 60 



Leu Asp Thr 
65 



Leu 



Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
70 75 80 
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Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
85 90 95 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
100 105 110 

Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
115 120 125 

Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser 
130 135 140 

Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys lie 
145 150 155 160 

Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
15 10 15 

Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
20 25 30 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
35 40 45 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
50 55 60 

Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro 
65 70 75 80 

Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu 



165 



170 



85 



90 



95 



Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
100 105 110 
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Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu 
115 120 ,125 

Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser 
130 135 140 

Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu 
145 150 155 160 

Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
165 170 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
15 10 15 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
20 25 30 

Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
35 40 45 

Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu 
50 55 60 

Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu 
65 70 75 80 

Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu 
85 90 95 

Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser 
100 105 110 

Cys Pr Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His 
115 120 125 



Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie 
130 135 140 
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Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
145 150 155 160 

Asp Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly 
165 170 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala 
15 10 15 

Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser 
20 25 30 

Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu Gly Pro Ala 
35 40 45 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
50 55 60 

Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
65 70 75 80 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
85 90 95 

Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
100 105 110 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
115 120 125 

Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr 
130 135 140 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
145 150 155 160 

Gin Gin Met Glu Glu Leu Gly Met Ala Pr Ala Leu Gin Pro 
165 170 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
15 10 15 

Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
20 25 30 

Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu 
35 40 45 

Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu 
50 55 60 

Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu 
65 70 75 80 

Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser 
85 90 95 

Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His 
100 105 110 

Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He 
115 120 125 

Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
130 135 140 

Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala 
145 150 155 160 

Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
165 170 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amin acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
15 10 15 

Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
20 25 30 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
35 40 45 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
50 55 60 

Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro 
65 70 75 80 

Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu 
85 90 95 

Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
100 105 110 

Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu 
115 120 125 

Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser 
130 135 140 

Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu 
145 150 155 160 

Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
165 170 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: pr tein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 

1.5 10 is 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
20 25 30 

Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
35 40 45 

Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu 
50 55 60 

Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu 
65 70 75 80 

Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu 
85 90 95 

Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser 
100 105 110 

Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His 
115 120 125 

Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie 
130 135 140 

Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
145 150 155 160 

Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly 
165 170 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala 
15 10 15 
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Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser 
20 25 30 



Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu Gly Pro Ala 
35 40 45 



Ser Ser Leu Pro Gin Ser Phe Leu 
50 55 

Lys He Gin Gly Asp Gly Ala Ala 
65 70 

Tyr Lys Leu Cys His Pro Glu Glu 
85 



Leu Lys Ser Leu Glu Gin Val Arg 
60 

Leu Gin Glu Lys Leu Cys Ala Thr 
75 80 

Leu Val Leu Leu Gly His Ser Leu 
90 95 



Gly He Pro Trp Ala Pro Leu Ser 
100 

Leu Ala Gly Cys Leu Ser Gin Leu 
115 120 

Gly Leu Leu Gin Ala Leu Glu Gly 
130 135 

Leu Asp Thr Leu Gin Leu Asp Val 
145 150 

Gin Gin Met Glu Glu Leu Gly Met 
165 

(2) INFORMATION FOR SEQ ID NO: 56: 



Ser Cys Pro Ser Gin Ala Leu Gin 
105 110 

His Ser Gly Leu Phe Leu Tyr Gin 
125 

He Ser Pro Glu Leu Gly Pro Thr 
140 

Ala Asp Phe Ala Thr Thr He Trp 
155 160 

Ala Pro Ala Leu Gin Pro 
170 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
15 10 15 

Gin Ser Ph Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 

20 25 . 30 

Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu 
35 40 45 
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Lys Ser Leu Glu Gin Val Arg Lys 
50 55 

Gin Glu Lys Leu Cys Ala 
65 70 

Val Leu Leu Gly His Ser 
85 

Cys Pro Ser Gin Ala Leu Gin Leu 
100 

Ser Gly Leu Phe Leu Tyr Gin Gly 
115 120 

Ser Pro Glu Leu Gly Pro Thr Leu 
130 135 

Asp Phe Ala Thr Thr He 
145 150 

Pro Ala Leu Gin Pro Thr 
165 

(2) INFORMATION FOR SEQ ID NO: 57: 



He Gin Gly Asp Gly Ala Ala Leu 
60 



Ala Gly Cys Leu Ser Gin Leu His 
105 HO 

Leu Leu Gin Ala Leu Glu Gly He 
125 

Asp Thr Leu Gin Leu Asp Val Ala 
140 



Trp Gin Gin Met Glu Glu Leu Gly Met Ala 
155 160 

Gin Gly Ala Met Pro Ala Phe Ala 
170 



Thr Tyr Lys Leu Cys His Pro Glu Glu Leu 

75 80 

Leu Gly He Pro Trp Ala Pro Leu Ser Ser 
90 95 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
15 10 15 

Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
20 25 30 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
35 40 45 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
50 55 60 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Ser Gly 
65 70 75 80 
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Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
85 90 95 

Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
100 105 110 

Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
115 120 125 

Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 
130 135 140 

Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
145 150 155 160 

Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 
165 170 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
15 10 15 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
20 25 30 

Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
35 40 45 

Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser Leu 
50 * 55 60 

Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
65 70 75 80 

Ala Thr Tyr Lys Leu Cys His Pr Glu Glu Leu Val Leu L u Gly His 
85 90 95 

Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala 
100 105 HO 
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Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu 
115 120 125 

Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly 
130 135 140 

Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr 
145 150 155 160 

He Trp Gin Gin Met Glu Glu Leu Gly 
165 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala 
15 10 15 

Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser 
20 25 30 

Tyr Arg Val Leu Arg His Leu Ala Gin Pro Ser Gly Gly Ser Gly Gly 
35 40 45 

Ser Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin 
50 55 60 

Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu 
65 70 75 80 

Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro 
85 90 95 

Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly 
100 105 110 

Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu 
115 120 125 



Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr 
130 135 140 
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Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp Gin Gin Met 
145 150 155 160 

Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro 
165 170 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
15 10 15 

Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
20 25 30 

Pro Ser Gly Gly Ser Gly Gly Ser Gin Ser Phe Leu Leu Lys Ser Leu 
35 40 45 

Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
50 55 60 

Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu 
65 70 75 80 

Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser 
85 90 95 

Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu 
100 105 110 

Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu 
115 120 125 

Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala 
130 135 140 

Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu 
145 150 155 160 



Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
165 170 
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(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Gly Gly Gly Ser Gly Gly Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Ser Gly Gly Ser Gly Gly Ser 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Glu Phe Gly Asn Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Glu Phe Gly Gly Asn Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Glu Phe Gly Gly Asn Gly Gly Asn Met 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Gly Gly Ser Asp Met Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc * "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

GATCGACCAT GGCTCTGCTC GGACACTCTC TG 
32 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)' 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
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CGATCGAAGC TTATTACACC AGCTCCTCGG GGTGGC 
36 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

GATCGACCAT GGCTCAACTC CATAGCGGCC TT 
32 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) ■ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CGATCGAAGC TTATTAGCTC AAGCAGCCTG CCAGCT 
36 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GATCGACCAT GGCTCTTTTC CTCTACCAGG GG 
32 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc «= "DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGATCGAAGC TTATTAGCCG CTATGGAGTT GGCTCA 
36 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

GATCGACCAT GGCTCTCTAC CAGGGGCTCC TG 
32 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = -DNA (synthetic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CGATCGAAGC TTATTAGAAA AGGCCGCTAT GGAGTT 
36 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GATCGACCAT GGCTGCCCTG GAAGGGATAT CC 
32 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

CGATCGAAGC TTATTACTGC AGGAGCCCCT GGTAGA 
36 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

GATCGACCAT GGCTGACTTT GCCACCACCA TC 
32 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

CGATCGAAGC TTATTAGGCG ACGTCCAGCT GCAGTG 
36 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

GATCGACCAT GGCTATCTGG CAGCAGATGG AA 
32 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ocher nucleic acid 

(A) DESCRIPTION: /desc « " DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CGATCGAAGC TTATTAGGTG GTGGCAAAGT CGGCGA 
36 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

GATCGACCAT GGCTCAGCAG ATGGAAGAAC TG 
32 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CGATCGAAGC TTATTACCAG ATGGTGGTGG CAAAGT 
36 

(2) INFORMATION FOR SEQ ID NO: 84: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CATGGCTTTG TTAGGACATT CTTTAGGTAT TCCATGGGCT CCTCTGAGCT 
50 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA {synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

CAGAGGAGCC CATGGAATAC CTAAAGAATG TCCTAACAAA 
40 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
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ATGGCTCTGC TCGGACACTC TCTGGGCATC CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC 
60 

CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC CAACTCCATA GCGGCCTTTT CCTCTACCAG 
120 

GGGCTCCTGC AGGCCCTGGA AGGGATATCC CCCGAGTTGG GTCCCACCTT GGACACACTG 
180 

CAGCTGGACG TCGCCGACTT TGCCACCACC ATCTGGCAGC AGATGGAAGA ACTGGGAATG 
240 

GCCCCTGCCC TGCAGCCCAC CCAGGGTGCC ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC 
300 

CGGGCAGGAG GGGTCCTGGT TGCTAGCCAT CTGCAGAGCT TCCTGGAGGT GTCGTACCGC 
360 

GTTCTACGCC ACCTTGCGCA GCCCACACCA TTGGGCCCTG CCAGCTCCCT GCCCCAGAGC 
420 

TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA AAGATCCAGG GCGATGGCGC AGCGCTCCAG 
480 

GAGAAGCTGT GTGCCACCTA CAAGCTGTGC CACCCCGAGG AGCTGGTGTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

ATGGCTCAAC TCCATAGCGG CCTTTTCCTC TACCAGGGGC TCCTGCAGGC CCTGGAAGGG 
60 

ATATCCCCCG AGTTGGGTCC CACCTTGGAC ACACTGCAGC TGGACGTCGC CGACTTTGCC 
120 

ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC CTGCCCTGCA GCCCACCCAG 
180 

GGTGCCATGC CGGCCTTCGC CTCTGCTTTC CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT 
240 
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AGCCATCTGC AGAGCTTCCT GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC 
300 



ACACCATTGG GCCCTGCCAG CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA 
360 



GTGAGAAAGA TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC CACCTACAAG 
420 



CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC TGGGCATCCC CTGGGCTCCC 
480 

CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG CTGGCAGGCT GCTTGAGCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

ATGGCTCTTT TCCTCTACCA GGGGCTCCTG CAGGCCCTGG AAGGGATATC CCCCGAGTTG 
60 

GGTCCCACCT TGGACACACT GCAGCTGGAC GTCGCCGACT TTGCCACCAC CATCTGGCAG 
120 

CAGATGGAAG AACTGGGAAT GGCCCCTGCC CTGCAGCCCA CCCAGGGTGC CATGCCGGCC 
180 

TTCGCCTCTG CTTTCCAGCG CCGGGCAGGA GGGGTCCTGG TTGCTAGCCA TCTGCAGAGC 
240 

TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC AGCCCACACC ATTGGGCCCT 
300 

GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC AAGTCTTTAG AGCAAGTGAG AAAGATCCAG 
360 

GGCGATGGCG CAGCGCTCCA GGAGAAGCTG TGTGCCACCT ACAAGCTGTG CCACCCCGAG 
420 
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GAGCTGGTGC TGCTCGGACA CTCTCTGGGC ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC 
480 

AGCCAGGCCC TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

ATGGCTCTCT ACCAGGGGCT CCTGCAGGCC CTGGAAGGGA TATCCCCCGA GTTGGGTCCC 
60 

ACCTTGGACA CACTGCAGCT GGACGTCGCC GACTTTGCCA CCACCATCTG GCAGCAGATG 
120 

GAAGAACTGG GAATGGCCCC TGCCCTGCAG CCCACCCAGG GTGCCATGCC GGCCTTCGCC 
180 

TCTGCTTTCC AGCGCCGGGC AGGAGGGGTC CTGGTTGCTA GCCATCTGCA GAGCTTCCTG 
240 

GAGGTGTCGT ACCGCGTTCT ACGCCACCTT GCGCAGCCCA CACCATTGGG CCCTGCCAGC 
300 

TCCCTGCCCC AGAGCTTCCT GCTCAAGTCT TTAGAGCAAG TGAGAAAGAT CCAGGGCGAT 
360 

GGCGCAGCGC TCCAGGAGAA GCTGTGTGCC ACCTACAAGC TGTGCCACCC CGAGGAGCTG 
420 

GTGCTGCTCG GACACTCTCT GGGCATCCCC TGGGCTCCCC TGAGCTCCTG CCCCAGCCAG 
480 

GCCCTGCAGC TGGCAGGCTG CTTGAGCCAA CTCCATAGCG GCCTTTTCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 90: 

ii) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

ATGGCTGCCC TGGAAGGOAT ATCCCCCGAG TTGGGTCCCA CCTTGGACAC ACTGCAGCTG 
60 



GACGTCGCCG ACTTTGCCAC CACCATCTGG CAGCAGATGG AAGAACTGGG AATGGCCCCT 
120 

GCCCTGCAGC CCACCCAGGG TGCCATGCCG GCCTTCGCCT CTGCTTTCCA GCGCCGGGCA 
180 



GGAGGGGTCC TGGTTGCTAG CCATCTGCAG AGCTTCCTGG AGGTGTCGTA CCGCGTTCTA 
240 

CGCCACCTTG CGCAGCCCAC ACCATTGGGC CCTGCCAGCT CCCTGCCCCA GAGCTTCCTG 
300 

CTCAAGTCTT TAGAGCAAGT GAGAAAGATC CAGGGCGATG GCGCAGCGCT CCAGGAGAAG 
360 



CTGTGTGCCA CCTACAAGCT GTGCCACCCC GAGGAGCTGG TGCTGCTCGG ACACTCTCTG 
420 

GGCATCCCCT GGGCTCCCCT GAGCTCCTGC CCCAGCCAGG CCCTGCAGCT GGCAGGCTGC 
480 

TTGAGCCAAC TCCATAGCGG CCTTTTCCTC TACCAGGGGC TCCTGCAGTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
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ATGGCTGACT TTGCCACCAC CATCTGGCAG CAGATGGAAG AACTGGGAAT GGCCCCTGCC 
60 

CTGCAGCCCA CCCAGGGTGC CATGCCGGCC TTCGCCTCTG CTTTCCAGCG CCGGGCAGGA 
120 

GGGGTCCTGG TTGCTAGCCA TCTGCAGAGC TTCCTGGAGG TGTCGTACCG CGTTCTACGC 
180 

CACCTTGCGC AGCCCACACC ATTGGGCCCT GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC 
240 

AAGTCTTTAG AGCAAGTGAG AAAGATCCAG GGCGATGGCG CAGCGCTCCA GGAGAAGCTG 
300 

TGTGCCACCT ACAAGCTGTG CCACCCCGAG GAGCTGGTGC TGCTCGGACA CTCTCTGGGC 
360 

ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC AGCCAGGCCC TGCAGCTGGC AGGCTGCTTG 
420 

AGCCAACTCC ATAGCGGCCT TTTCCTCTAC CAGGGGCTCC TGCAGGCCCT GGAAGGGATA 
480 

TCCCCCGAGT TGGGTCCCAC CTTGGACACA CTGCAGCTGG ACGTCGCCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

ATGGCTATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC CTGCCCTGCA GCCCACCCAG 
60 

GGTGCCATGC CGGCCTTCGC CTCTGCTTTC CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT 
120 

AGCCATCTGC AGAGCTTCCT GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC 
160 

ACACCATTGG GCCCTGCCAG CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA 
240 
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GTGAGAAAGA TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC CACCTACAAG 
300 

CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC TGGGCATCCC CTGGGCTCCC 
360 

CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG CTGGCAGGCT GCTTGAGCCA ACTCCATAGC 
420 

GGCCTTTTCC TCTACCAGGG GCTCCTGCAG GCCCTGGAAG GGATATCCCC CGAGTTGGGT 
480 

CCCACCTTGG ACACACTGCA GCTGGACGTC GCCGACTTTG CCACCACCTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

ATGGCTCAGC AGATGGAAGA ACTGGGAATG GCCCCTGCCC TGCAGCCCAC CCAGGGTGCC 
60 

ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC CGGGCAGGAG GGGTCCTGGT TGCTAGCCAT 
120 

CTGCAGAGCT TCCTGGAGGT GTCGTACCGC GTTCTACGCC ACCTTGCGCA GCCCACACCA 
180 

TTGGGCCCTG CCAGCTCCCT GCCCCAGAGC TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA 
240 

AAGATCCAGG GCGATGGCGC AGCGCTCCAG GAGAAGCTGT GTGCCACCTA CAAGCTGTGC 
300 

CACCCCGAGG AGCTGGTGCT GCTCGGACAC TCTCTGGGCA TCCCCTGGGC TCCCCTGAGC 
360 

TCCTGCCCCA GCCAGGCCCT GCAGCTGGCA GGCTGCTTGA GCCAACTCCA TAGCGGCCTT 
420 
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TTCCTCTACC AGGGGCTCCT GCAGGCCCTG GAAGGGATAT CCCCCGAGTT GGGTCCCACC 
480 

TTGGACACAC TGCAGCTGGA CGTCGCCGAC TTTGCCACCA CCATCTGGTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic)" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

ATGGCTTTGT TAGGACATTC TTTAGGTATT CCATGGGCTC CTCTGAGCTC CTGCCCCAGC 
60 

CAGGCCCTGC AGCTGGCAGG CTGCTTGAGC CAACTCCATA GCGGCCTTTT CCTCTACCAG 
120 

GGGCTCCTGC AGGCCCTGGA AGGGATATCC CCCGAGTTGG GTCCCACCTT GGACACACTG 
180 

CAGCTGGACG TCGCCGACTT TGCCACCACC ATCTGGCAGC AGATGGAAGA ACTGGGAATG 
240 

GCCCCTGCCC TGCAGCCCAC CCAGGGTGCC ATGCCGGCCT TCGCCTCTGC TTTCCAGCGC 
300 

CGGGCAGGAG GGGTCCTGGT TGCTAGCCAT CTGCAGAGCT TCCTGGAGGT GTCGTACCGC 
360 

GTTCTACGCC ACCTTGCGCA GCCCACACCA TTGGGCCCTG CCAGCTCCCT GCCCCAGAGC 
420 

TTCCTGCTCA AGTCTTTAGA GCAAGTGAGA AAGATCCAGG GCGATGGCGC AGCGCTCCAG 
480 

GAGAAGCTGT GTGCCACCTA CAAGCTGTGC CACCCCGAGG AGCTGGTGTA ATAA 
534 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174. amino acids 

(B) TYPE: amin acid 
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(C> STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys 
15 10 15 

Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
20 25 30 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
35 40 45 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
50 55 60 

Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
65 70 75 80 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
85 90 95 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
100 105 110 

Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro 
115 120 125 

Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu 
130 135 140 

Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
145 150 155 160 

Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
165 170 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu 
15 10 15 

Glu Gly lie Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu 
20 25 30 

Asp Val Ala Asp Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu 
35 40 45 

Gly Met Ala Pro Ala Leu Gin, Pro Thr Gin Gly Ala Met Pro Ala Phe 
50 55 60 

Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
65 70 75 80 

Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala 
85 90 95 

Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu 
100 105 110 

Leu Lya Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala 
115 120 125 

Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu 
130 135 140 

Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser 
145 150 155 160 

Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser 
165 170 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro 
1 5 10 15 



WO 97/12977 PCT/US96/15935 

158 



Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
20 25 30 

Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
35 40 45 

Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
50 55 60 

Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
65 70 75 80 

Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu 
85 90 95 

Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu 
100 105 HO 

Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu 
115 120 125 



Cys Ala Thr Tyr Lys Leu Cys His 
130 135 

His Ser Leu Gly lie Pro Trp Ala 
145 150 

Ala Leu Gin Leu Ala Gly Cys Leu 
165 

(2) INFORMATION FOR SEQ ID NO: 98: 



Pro Glu Glu Leu Val Leu Leu Gly 
140 

Pro Leu Ser Ser Cys Pro Ser Gin 
155 160 

Ser Gin Leu His Ser Gly 
170 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu 
15 10 15 

Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr 
20 25 30 

Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin 
35 40 45 
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Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg 
50 55 60 

Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val 
65 70 75 80 

Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu Gly Pro 
85 90 95 

Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
100 105 110 

Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
115 120 125 

Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
130 135 140 

Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 
145 150 155 ' 160 

Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu 
1 5 10 15 

Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu 
20 25 30 

Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro 
35 40 45 

Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala 
50 55 60 

Ser His Leu Gin Ser Phe Leu Glu Val S r Tyr Arg Val Leu Arg His 
65 70 75 80 
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Leu Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser 
85 90 95 

Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly 
100 105 no 

Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro 
115 120 125 

Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro 
130 135 * 140 

Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser 
145 150 155 * 160 

Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 
165 170 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala 
15 10 15 

Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 
20 25 30 

Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser 
35 40 45 

Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr 
50 55 60 

Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 

65 70 75 80 

Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu 
85 90 95 



Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
100 105 110 
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Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 
115 120 125 

Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
130 135 140 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 
145 150 155 160 

Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
165 170 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro 
1 5 10 15 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala 
20 25 30 

Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser 
35 40 45 

Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu Gly Pro Ala 
50 55 60 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
65 70 75 80 

Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
85 90 95 



Tyr Lys Leu Cys His Pro Glu Glu 
100 

Gly He Pro Trp Ala Pro Leu Ser 

115 120 

Leu Ala Gly Cys Leu Ser Gin Leu 

130 135 



Leu Val Leu Leu Gly His Ser Leu 
105 HO 

Ser Cys Pr Ser Gin Ala Leu Gin 
125 

His Ser Gly Leu Phe Leu Tyr Gin 
140 
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Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr 
145 150 155 160 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr 
165 170 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
1 5 10 15 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
20 25 30 

Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
35 40 45 

Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser 
50 55 60 

Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He 
65 70 75 80 

Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys 
85 90 95 

Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He 
100 105 110 

Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala 
115 120 125 

Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu 
130 135 140 

Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp 
145 150 155 160 



Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
165 170 
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(2) INFORMATION FOR SEQ ID NO : 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
15 10 15 

Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
20 25 30 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
35 40 45 

Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
50 55 60 

Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
65 70 75 80 

Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
85 90 95 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
100 105 110 

Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro 
115 120 125 

Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu 
130 135 140 

Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
145 150 155 160 

Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
165 170 
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WHAT IS CLAIMED IS: 



1. A human G-CSF receptor agonist polypeptide, 
comprising a modified G-CSF amino acid sequence of the 
Formula : 

1 10 

Xaa Xaa Xaa Gly Pro Ala Ser Ser Leu Pro Gin Ser Xaa 

20 

Leu Leu Xaa Xaa Xaa Glu Gin Val Xaa Lys Xaa Gin Gly Xaa Gly 



30 40 

15 Ala Xaa Leu Gin Glu Xaa Leu Xaa Ala Thr Tyr Lys Leu Xaa Xaa 

50 

Xaa Glu Xaa Xaa Val Xaa Xaa Gly His Ser Xaa Gly He Pro Trp 

20 60 70 

Ala Pro Leu Ser Ser Xaa Pro Ser Xaa Ala Leu Xaa Leu Ala Gly 



80 

Xaa Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu 

90 100 

Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu 



110 

30 Xaa Thr Leu Gin Xaa Asp Val Ala Asp Phe Ala Xaa Thr He Trp 

120 130 
Gin Gin Met Glu Xaa Xaa Gly Met Ala Pro Ala Leu Gin Pro Thr 

35 140 

Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Xaa Gin Xaa Xaa Ala 



150 160 
Gly Gly Val Leu Val Ala Ser Xaa Leu Gin Xaa Phe Leu Xaa Xaa 

170 

Ser Tyr Arg Val Leu Xaa Xaa Leu Ala Gin Pro (SEQ ID NO:l) 
wherein 



Xaa at position 1 is Thr, Ser, Arg, Tyr or Gly; 
Xaa at position 2 is Pro or Leu; 
Xaa at position 3 is Leu, Arg, Tyr or Ser; 
xaa at position 13 is Phe, Ser, His, Thr or Pro; 
50 Xaa at position 16 is Lys, Pro, Ser, Thr or His; 
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Xaa at position 17 is Cys, Ser, Gly , Ala, lie, Tyr or Arg; 

Xaa at position 18 is Leu, Thr, Pro, His, lie or Cys; 

Xaa at position 22 is Arg, Tyr, Ser, Thr or Ala; 

Xaa at position 24 is lie, Pro, Tyr or Leu; 
5 Xaa at position 27 is Asp, or Gly; 

Xaa at position 30 is Ala, lie. Leu or Gly; 

Xaa at position 34 is Lys or Ser; 

Xaa at position 36 is Cys or Ser; 

Xaa at position 42 is Cys or Ser; 
10 Xaa at position 43 is His, Thr # Gly, Val, Lys, Trp, Ala, 
Arg, Cys, or Leu; 

Xaa at position 44 is Pro, Gly, Arg, Asp, Val, Ala, His, 
Trp, vJin, or Thr; 

Xaa at position 46 is Glu, Arg, Phe, Arg, lie or Ala; 
15 Xaa at position 47 is Leu or Thr; 

Xaa at position 49 is Leu, Phe, Arg or Ser; 

Xaa at position 50 is Leu, lie, His, Pro or Tyr; 

Xaa at position 54 is Leu or His; 

Xaa at position 64 is Cys or Ser; 
20 Xaa at position 67 is Gin, Lys, Leu or Cys; 

Xaa at position 70 is Gin, Pro, Leu, Arg or Ser; 

Xaa at position 74 is Cys or Ser; 





Xaa 


at 


position 


104 


is 


Asp, 


Gly or Val; 






Xaa 


at 


position 


108 


is 


Leu, 


Ala, Val, Arg, Trp, Gin or 


Gly; 


25 


Xaa 


at 


position 


115 


is 


Thr, 


His, Leu or Ala; 






Xaa 


at 


position 


120 


is 


Gin, 


Gly, Arg, Lys or His 






Xaa 


at 


position 


123 


is 


Glu, 


Arg, Phe or Thr 






Xaa 


at 


position 


144 


is 


Phe, 


His, Arg, Pro, Leu, Gin or 


Glu; 




Xaa 


at 


position 


146 


is 


Arg 


or Gin; 




30 


Xaa 


at 


position 


147 


is 


Arg 


or Gin; 






Xaa 


at 


position 


156 


is 


His, 


Gly or Ser; 






Xaa 


at 


position 


159 


is 


Ser, 


Arg, Thr, Tyr, Val or Gly; 






Xaa 


at 


position 


162 


is 


Glu, 


Leu, Gly or Trp; 






Xaa 


at 


position 


163 


is 


Val, 


Gly, Arg or Ala; 




35 


Xaa 


at 


position 


169 


is 


Arg, 


Ser, Leu, Arg or Cys; 






Xaa 


at 


position 


170 


is 


His, 


Arg or Ser; 





wherein optionally 1-11 amino acids from the N- terminus and 
40 1-5 from the C- terminus can be deleted; 



45 



wherein the N-terminus is joined to the C-terminus directly 
or through a linker capable of joining the N-terminus to the 
C-terminus and having new C- and N-terminus at amino acids; 

38- 39 62-63 123-124 

39- 40 63-64 124-125 

40- 41 64-65 125-126 

41- 42 65-66 126-127 
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42-43 


66-67 


128-129 




43-44 


67-68 


128-129 




45-46 


68-69 


129-130 




48-49 


69-70 


130-131 


5 


49-50 


70-71 


131-132 




52-53 


71-72 


132-133 




53-54 


91-92 


133-134 




54-55 


92-93 


134-135 




55-56 


93-94 


135-136 


10 


56-57 


94-95 


136-137 




57-58 


95-96 


137-138 




58-59 


96-97 


138-139 




59-60 


97-98 


139-140 




60-61 


98-99 


140-141 


15 


61-62 


99-100 


141-142 



or 142-143; and 



said G-CSF receptor agonist polypeptide may optionally be 
20 immediately preceded by (methionine" 1 ) , (alanine" 1 ) or 

(methionine- ' 2 , alanine" 1 ) . 



2. The G-CSF receptor agonist polypeptide, as 
recited in claim 1, wherein said linker is selected from the 
25 group consisting of; 

GlyGlyGlySer (SEQ ID N0:2); 

GlyGlyGlySerGlyGlyGlySer (SEQ ID NO: 61); 

GlyGlyGlySerGlyGlyGlySerGlyGlyGlySer (SEQ IDNO:62); 

SerGlyGlySerGlyGlySer (SEQ ID NO:63); 
30 GluPheGlyAsnMet (SEQ ID NO: 64); 

GluPheGlyGlyAsnMet (SEQ ID NO: 65); 

GluPheGlyGlyAsnGlyGlyAsnMet (SEQ ID NO: 66); and 

GlyGlySerAspMetAlaGly (SEQ ID NO: 67). 

35 3. The G-CSF receptor agonist polypeptide of claim 

1, selected from the group consisting of; 



Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 

40 Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala 

Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe 

Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu 

Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 

Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 



i 
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Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 
Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 
Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin 
Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu 
5 Leu Lys Ser Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala 
Ala Leu Gin Glu Lys Leu Cys Ala Thr (SEQ ID NO:48); 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
10 Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala 
Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 
Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser 
15 Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Asp 
Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 
His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro 
Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala 
Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly 
20 Leu Leu Gin Ala Leu Glu Gly He Ser (SEQ ID NO:49); 



Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe 
Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser 

25 His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His 
Leu Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin 
Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly 
Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu 
Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He 

30 Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu 
Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro 
Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr 
He Trp Gin Gin Met Glu Glu Leu Gly (SEQ ID NO: 50); 

35 

Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg 
Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 
Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro Leu 

40 Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu 
Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu 
Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser 
Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 

45 His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu 
Gly lie Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu 
Asp Val Ala Asp Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu 
Leu Gly Met Ala Pro Ala Leu Gin Pro (SEQ ID NO: 51); and 

50 

Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu 
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Ala Gin Pro Thr Pro Leu Gly Pro Ala Ser Ser 

Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys 

Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 

His Pro Glu Glu Leu Val Leu Leu Gly His Ser 

Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala 

Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe 

Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 

Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

Thr Gin Gly Ala Met Pro Ala Phe Ala (SEQ ID 



Leu Pro 
He Gin 
Tyr Lys 
Leu Gly 
Leu Gin 
Leu Tyr 
Leu Gly 
Ala Thr 
Ala Leu 
NO: 52) . 



Gin Ser 
Gly Asp 
Leu Cys 
He Pro 
Leu Ala 
Gin Gly 
Pro Thr 
Thr He 
Gin Pro 



15 



4. A nucleic acid molecule comprising a DNA sequence 
encoding the G-CSF receptor agonist polypeptide of claim 1. 

5. a nucleic acid molecule comprising a DNA sequence 
encoding the G-CSF receptor agonist polypeptide of claim 2. 



20 



6. A nucleic acid molecule comprising a DNA sequence 
encoding the G-CSF receptor agonist polypeptide of claim 3. 



7. A nucleic acid molecule of claim 6 selected from 
group consisting of; 



25 



30 



35 



1 ATGGCTTACA 

51 TCTGGGCATC 

101 AGCTGGCAGG 

151 GGGCTCCTGC 

201 GGACACACTG 

251 AGATGGAAGA 

301 ATGCCGGCCT 

351 TGCTAGCCAT 

401 ACCTTJCGCA 

451 TTCCTGCTCA 

501 AGCGCTCCAG 



AGCTGTGCCA 
CCCTGGGCTC 
CTGCTTGAGC 
AGGCCCTGGA 
CAGCTGGACG 
ACTGGGAATG 
TCGCCTCTGC 
CTGCAGAGCT 
GCCCACACCA 
AGTCTTTAGA 
GAGAAGCTGT 



CCCCGAGGAG 
CCCTGAGCTC 
CAACTCCATA 
AGGGATATCC 
TCGCCGACTT 
GCCCCTGCCC 
TTTCCAGCGC 
TCCTGGAGGT 
TTGGGCCCTG 
GCAAGTGAGA 
GTGCCACCTA 



CTGGTGCTGC 
CTGCCCCAGC 
GCGGCCTTTT 
CCCGAGTTGG 
TGCCACCACC 
TGCAGCCCAC 
CGGGCAGGAG 
GTCGTACCGC 
CCAGCTCCCT 
AAGATCCAGG 
ATAA (SEQ 



TCGGACACTC 
CAGGCCCTGC 
CCTCTACCAG 
GTCCCACCTT 
ATCTGGCAGC 
CCAGGGTGCC 
GGGTCCTGGT 
GTTCTACGCC 
GCCCCAGAGC 
GCGATGGCGC 
ID NO: 30) ; 



1 ATGGCTCCCG AGTTGGGTCC CACCTTGGAC ACACTGCAGC TGGACGTCGC 

40 51 CGACTTTGCC ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC 

101 CTGCCCTGCA GCCCACCCAG GGTGCCATGC CGGCCTTCGC CTCTGCTTTC 

151 CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT AGCCATCTGC AGAGCTTCCT 

201 GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC ACACCATTGG 

251 GCCCTGCCAG CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA 

45 301 GTGAGAAAGA TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC 
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■ 351 CACCTACAAG CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC 

401 TGGGCATCCC CTGGGCTCCC CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG 

451 CTGGCAGGCT GCTTGAGCCA ACTCCATAGC GGCCTTTTCC TCTACCAGGG 

501 GCTCCTGCAG GCCCTGGAAG GGATATCCTA ATAA (SEQ ID NO: 31); 

5 

1 ATGGCTATGG CCCCTGCCCT GCAGCCCACC CAGGGTGCCA TGCCGGCCTT 

51 CGCCTCTGCT TTCCAGCGCC GGGCAGGAGG GGTCCTGGTT GCTAGCCATC 

101 TGCAGAGCTT CCTGGAGGTG TCGTACCGCG TTCTACGCCA CCTTGCGCAG 

10 151 CCCACACCAT TGGGCCCTGC CAGCTCCCTG CCCCAGAGCT TCCTGCTCAA 

201 GTCTTTAGAG CAAGTGAGAA AGATCCAGGG CGATGGCGCA GCGCTCCAGG 

251 AGAAGCTGTG TGCCACCTAC AAGCTGTGCC ACCCCGAGGA GCTGGTGCTG 

301 CTCGGACACT CTCTGGGCAT CCCCTGGGCT CCCCTGAGCT CCTGCCCCAG 

351 CCAGGCCCTG CAGCTGGCAG GCTGCTTGAG CCAACTCCAT AGCGGCCTTT 

15 401 TCCTCTACCA GGGGCTCCTG CAGGCCCTGG AAGGGATATC CCCCGAGTTG 

451 GGTCCCACCT TGGACACACT GCAGCTGGAC GTCGCCGACT TTGCCACCAC 

501 CATCTGGCAG CAGATGGAAG AACTGGGATA ATAA (SEQ ID NO: 32); 



20 1 ATGGCTACCC AGGGTGCCAT GCCGGCCTTC GCCTCTGCTT TCCAGCGCCG 

51 GGCAGGAGGG GTCCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT 

101 CGTACCGCGT TCTACGCCAC CTTGCGCAGC CCACACCATT GGGCCCTGCC 

151 AGCTCCCTGC CCCAGAGCTT CCTGCTCAAG TCTTTAGAGC AAGTGAGAAA 

201 GATCCAGGGC GATGGCGCAG CGCTCCAGGA GAAGCTGTGT GCCACCTACA 

25 251 AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC TCTGGGCATC 

301 CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGC AGCTGGCAGG 

351 CTGCTTGAGC CAACTCCATA GCGGCCTTTT CCTCTACCAG GGGCTCCTGC 

401 AGGCCCTGGA AGGGATATCC CCCGAGTTGG GTCCCACCTT GGACACACTG 

451 CAGCTGGACG TCGCCGACTT TGCCACCACC ATCTGGCAGC AGATGGAAGA 

30 501 ACTGGGAATG GCCCCTGCCC TGCAGCCCTA ATAA (SEQ ID NO: 33); 

1 ATGGCTTCTG CTTTCCAGCG CCGGGCAGGA GGGGTCCTGG TTGCTAGCCA 

51 TCTGCAGAGC TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC 

35 101 AGCCCACACC ATTGGGCCCT GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC 

151 AAGTCTTTAG AGCAAGTGAG AAAGATCCAG GGCGATGGCG CAGCGCTCCA 

201 GGAGAAGCTG TGTGCCACCT ACAAGCTGTG CCACCCCGAG GAGCTGGTGC 

251 TGCTCGGACA CTCTCTGGGC ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC 

301 AGCCAGGCCC TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCCT 

40 351 TTTCCTCTAC CAGGGGCTCC TGCAGGCCCT GGAAGGGATA TCCCCCGAGT 

401 TGGGTCCCAC CTTGGACACA CTGCAGCTGG ACGTCGCCGA CTTTGCCACC 

451 ACCATCTGGC AGCAGATGGA AGAACTGGGA ATGGCCCCTG CCCTGCAGCC 

501 CACCCAGGGT GCCATGCCGG CCTTCGCCTA ATAA (SEQ ID NO: 34); 

45 

1 ATGGCTCCGG AACTGGGTCC AACTCTGGAC ACACTGCAGC TGGACGTCGC 

51 CGACTTTGCC ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC 

101 CTGCCCTGCA GCCCACCCAG GGTGCCATGC CGGCCTTCGC CTCTGCTTTC 

151 CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT AGCCATCTGC AGAGCTTCCT 

50 201 GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC ACACCATTGG 

251 GCCCTGCCAG CTCCCTGCCC CAGAGCTTCC TGCTCAAGTC TTTAGAGCAA 

301 GTGAG/AAGA TCCAGGGCGA TGGCGCAGCG CTCCAGGAGA AGCTGTGTGC 
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351 CACCTACAAG CTGTGCCACC CCGAGGAGCT GGTGCTGCTC GGACACTCTC 

401 TGGGCATCCC CTGGGCTCCC CTGAGCTCCT GCCCCAGCCA GGCCCTGCAG 

451 CTGGCAGGCT GCTTGAGCCA ACTCCATAGC GGCCTTTTCC TCTACCAGGG 

501 GCTCCTGCAG GCCCTGGAAG GGATATCCTA A (SEQ ID NO: 35); 



1 ATGGCTATGG CTCCAGCTCT GCAACCAACT CAAGGTGCAA TGCCAGCATT 

51 TGCATCTGCT TTTCAACGTC GTGCAGGTGG TGTTCTGGTT GCTAGCCATC 

101 TGCAGAGCTT CCTGGAGGTG TCGTACCGCG TTCTACGCCA CCTTGCGCAG 

10 151 CCCACACCAT TGGGCCCTGC CAGCTCCCTG CCCCAGAGCT TCCTGCTCAA 

201 GTCTTTAGAG CAAGTGAGAA AGATCCAGGG CGATGGCGCA GCGCTCCAGG 

251 AGAAGCTGTG TGCCACCTAC AAGCTGTGCC ACCCCGAGGA GCTGGTGCTG 

301 CTCGGACACT CTCTGGGCAT CCCCTGGGCT CCCCTGAGCT CCTGCCCCAG 

351 CCAGGCCCTG CAGCTGGCAG GCTGCTTGAG CCAACTCCAT AGCGGCCTTT 

15 401 TCCTCTACCA GGGGCTCCTG CAGGCCCTGG AAGGGATATC CCCCGAGTTG 

451 GGTCCCACCT TGGACACACT GCAGCTGGAC GTCGCCGACT TTGCCACCAC 

501 CATCTHGCAG CAGATGGAAG AACTGGGATA A (SEQ ID NO: 36); 



20 1 ATGGCTACTC AAGGTGCTAT GCCAGCTTTT GCTTCTGCTT TTCAACGTCG 

51 TGCAGGTGGT GTTCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT 

101 CGTACCGCGT TCTACGCCAC CTTGCGCAGC CCACACCATT GGGCCCTGCC 

151 AGCTCCCTGC CCCAGAGCTT CCTGCTCAAG TCTTTAGAGC AAGTGAGAAA 

201 GATCCAGGGC GATGGCGCAG CGCTCCAGGA GAAGCTGTGT GCCACCTACA 

25 251 AGCTGTGCCA CCCCGAGGAG CTGGTGCTGC TCGGACACTC TCTGGGCATC 

301 CCCTGGGCTC CCCTGAGCTC CTGCCCCAGC CAGGCCCTGG AGCTGGCAGG 

351 CTGCTTGAGC CAACTCCATA GCGGCCTTTT CCTCTACCAG GGGCTCCTGC 

401 AGGCCCTGGA AGGGATATCC CCCGAGTTGG GTCCCACCTT GGACACACTG 

451 CAGCTGGACG TCGCCGACTT TGCCACCACC ATCTGGCAGC AGATGGAAGA 

30 501 ACTGGGAATG GCCCCTGCCC TGCAGCCCTA A (SEQ ID NO: 37); 

1 ATGGCTTCTG CTTTTCAACG TCGTGCAGGT GGTGTTCTGG TTGCTAGCCA 

51 TCTGCAGAGC TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC 

35 101 AGCCCACACC ATTGGGCCCT GCCAGCTCCC TGCCCCAGAG CTTCCTGCTC 

151 AAGTCTTTAG AGCAAGTGAG AAAGATCCAG GGCGATGGCG CAGCGCTCCA 

201 GGAGAAGCTG TGTGCCACCT ACAAGCTGTG CCACCCCGAG GAGCTGGTGC 

251 TGCTCGGACA CTCTCTGGGC ATCCCCTGGG CTCCCCTGAG CTCCTGCCCC 

301 AGCCAGGCCC TGCAGCTGGC AGGCTGCTTG AGCCAACTCC ATAGCGGCCT 

40 351 TTTCCTCTAC CAGGGGCTCC TGCAGGCCCT GGAAGGGATA TCCCCCGAGT 

401 TGGGTCCCAC CTTGGACACA CTGCAGCTGG ACGTCGCCGA CTTTGCCACC 

451 ACCATCTGGC AGCAGATGGA AGAACTGGGA ATGGCCCCTG CCCTGCAGCC 

501 CACCCAGGGT GCCATGCCGG CCTTCGCCTA A (SEQ ID NO: 38); 

45 

1 ATGGCTCCGG AACTGGGTCC AACTCTGGAC ACACTGCAGC TGGACGTCGC 

51 CGACTTTGCC ACCACCATCT GGCAGCAGAT GGAAGAACTG GGAATGGCCC 

101 CTGCCCTGCA GCCCACCCAG GGTGCCATGC CGGCCTTCGC CTCTGCTTTC 

151 CAGCGCCGGG CAGGAGGGGT CCTGGTTGCT AGCCATCTGC AGAGCTTCCT 

50 201 GGAGGTGTCG TACCGCGTTC TACGCCACCT TGCGCAGCCC TCTGGCGGCT 

251 CTGGCGGCTC TCAGAGCTTC CTGCTCAAGT CTTTAGAGCA AGTGAGAAAG 

301 ATCCAGGGCG ATGGCGCAGC GCTCCAGGAG AAGCTGTGTG CCACCTACAA 
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351 GCTGTGCCAC CCCGAGGAGC TGGTGCTGCT CGGACACTCT CTGGGCATCC 

401 CCTGGGCTCC CCTGAGCTCC TGCCCCAGCC AGGCCCTGCA GCTGGCAGGC 

451 TGCTTGAGCC AACTCCATAG CGGCCTTTTC CTCTACCAGG GGCTCCTGCA 

501 GGCCCTGGAA GGGATATCCT AA (SEQ ID NO: 39); 



1 ATGGCTATGG CTCCAGCTCT GCAACCAACT CAAGGTGCAA TGCCAGCATT 

51 TGCATCTGCT TTTCAACGTC GTGCAGGTGG TGTTCTGGTT GCTAGCCATC 

101 TGCAGAGCTT CCTGGAGGTG TCGTACCGCG TTCTACGCCA CCTTGCGCAG 

151 CCCTCTGGCG GCTCTGGCGG CTCTCAGAGC TTCCTGCTCA AGTCTTTAGA 

201 GCAAGTGAGA AAGATCCAGG GCGATGGCGC AGCGCTCCAG GAGAAGCTGT 

251 GTGCCACCTA CAAGCTGTGC CACCCCGAGG AGCTGGTGCT GCTCGGACAC 

301 TCTCTGGGCA TCCCCTGGGC TCCCCTGAGC TCCTGCCCCA GCCAGGCCCT 

351 GCAGCTGGCA GGCTGCTTGA GCCAACTCCA TAGCGGCCTT TTCCTCTACC 

401 AGGGGCTCCT GCAGGCCCTG GAAGGGATAT CCCCCGAGTT GGGTCCCACC 

451 TTGGACACAC TGCAGCTGGA CGTCGCCGAC TTTGCCACCA CCATCTGGCA 

501 GCAGATGGAA GAACTGGGAT AA (SEQ ID NO: 40); 



1 ATGGCTACTC AAGGTGCTAT GCCAGCTTTT GCTTCTGCTT TTCAACGTCG 

51 TGCAGGTGGT GTTCTGGTTG CTAGCCATCT GCAGAGCTTC CTGGAGGTGT 

101 CGTACCGCGT TCTACGCCAC CTTGCGCAGC CCTC'GGCGG CTCTGGCGGC 

151 TCTCAGAGCT TCCTGCTCAA GTCTTTAGAG CAAGTGAGAA AGATCCAGGG 

201 CGATGGCGCA GCGCTCCAGG AGAAGCTGTG TGCCACCTAC AAGCTGTGCC 

251 ACCCCGAGGA GCTGGTGCTG CTCGGACACT CTCTGGGCAT CCCCTGGGCT 

301 CCCCTTAGCT CCTGCCCCAG CCAGGCCCTG CAGCTGGCAG GCTGCTTGAG 

351 CCAACTCCAT AGCGGCCTTT TCCTCTACCA GGGGCTCCTG CAGGCCCTGG 

401 AAGGGATATC CCCCGAGTTG GGTCCCACCT TGGACACACT GCAGCTGGAC 

451 GTCGCCGACT TTGCCACCAC CATCTGGCAG CAGATGGAAG AACTGGGAAT 

501 GGCCCCTGCC CTGCAGCCCT AA (SEQ ID NO: 41); and 



1 ATGGCTTCTG CTTTTCAACG TCGTGCAGGT GGTGTTCTGG TTGCTAGCCA 

51 TCTGCAGAGC TTCCTGGAGG TGTCGTACCG CGTTCTACGC CACCTTGCGC 

101 AGCCCTCTGG CGGCTCTGGC GGCTCTCAGA GCTTCCTGCT CAAGTCTTTA 

151 GAGCAAGTGA GAAAGATCCA GGGCGATGGC GCAGCGCTCC AGGAGAAGCT 

201 GTGTGCCACC TACAAGCTGT GCCACCCCGA GGAGCTGGTG CTGCTCGGAC 

251 ACTCTCTGGG CATCCCCTGG GCTCCCCTGA GCTCCTGCCC CAGCCAGGCC 

301 CTGCAGCTGG CAGGCTGCTT GAGCCAACTC CATAGCGGCC TTTTCCTCTA 

351 CCAGGGGCTC CTGCAGGCCC TGGAAGGGAT ATCCCCCGAG TTGGGTCCCA 

401 CCTTGGACAC ACTGCAGCTG GACGTCGCCG ACTTTGCCAC CACCATCTGG 

451 CAGCAGATGG AAGAACTGGG AATGGCCCCT GCCCTGCAGC CCACCCAGGG 

501 TGCCATGCCG GCCTTCGCCT AA (SEQ ID NO: 42). 



8. A method of producing a G-CSF receptor agonist 
polypeptide comprising: growing under suitable nutrient 
conditions, a hose cell transformed or transfected with a 
replicable vector comprising said nucleic acid molecule of 
claim 4, 5, 6 or 7 in a manner allowing expression of said 
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G-CSF receptor agonist polypeptide and recovering said G-CSF 
receptor agonist polypeptide. 



9. A composition comprising; a G-CSF receptor 
5 agonist polypeptide according to claim 1, 2, or 3; and a 
pharmaceutically acceptable carrier. 

10* A composition comprising; a G-CSF receptor 
agonist polypeptide according to claim 1, 2, or 3; a colony 
10 stimulating factor; and a pharmaceutically acceptable 
carrier. 



11. A composition comprising; a G-CSF receptor 
agonist polypeptide according to claim 1, 2, or 3; a colony 
15 stimulating factor selected from the group consisting of: 

GM-CSF, c-mpl ligand, M-CSF, erythropoietin, IL-1, IL-4, IL- 
2, IL-3, IL-5, IL 6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, 
IL-13, IL-15, LIF, flt3/flk2 ligand, human growth hormone, 
20 B-cell growth factor, B-cell differentiation factor, 

eosinophil differentiation factor and stem cell factor; and 



a pharmaceutically acceptable carrier. 



25 12. A method of stimulating the production of 

hematopoietic cells in a patient comprising the step of; 
administering said G-CSF receptor agonist polypeptide of 
claim 1, 2, or 3 to said patent. 

30 13. A method of stimulating the production of 

hematopoietic cells in a patient comprising the step of 
administering said composition of claim 9, 10 or 11 to said 
patient. 



+ 

WO 97/12977 



PCT/US96/15935 



173 



14. A method for selective ex vivo expansion of stem cells 
comprising the steps of; (a) separating stem cells from other 
cells; (b) culturing said separated stem cells with a selected 
culture medium comprising the polypeptide of claim 1, 2, or 3; 
and 

(c) harvesting said cultured cells. 

15. A method for selective ex vivo expansion of stem cells 
comprising the steps of; (a) separating stem cells from other 
cells; (b) culturing said separated stem cells with a selected 
culture medium comprising the composition of claim 9, 10 or 11; 
and 

(c) harvesting said cultured cells. 

16. A method for treatment of a patient having a 
hematopoietic disorder, comprising the steps of; (a) 
removing stem cells; (b) separating stem cells from other 
cells; (c) culturing said separated stem cells with a 
selected culture medium comprising the polypeptide of claim 
1, 2, or 3; 

(d) harvesting said cultured cells; and 

(e) transplanting said cultured cells into said 
patient. 

17. A method for treatment of a patient having a 
hematopoietic disorder, comprising the steps of; (a) 
removing stem cells; (b) separating stem cells from other 
cells; (c) culturing said separated stem cells with a 
selected culture medium comprising the composition of claim 
9; 

(d) harvesting said cultured cells; and 

(e) transplanting said cultured cells into said 
patient. 



WO 97/12977 



PCT/US96/15935 



174 



10 



18. A method for treatment of a patient having a 
hematopoietic disorder, comprising the steps of; (a) 
removing stem cells; (b) separating stem cells from other 
cells; (c) culturing said separated stem cells with a 
selected culture medium comprising the composition of claim 
10; 

(d) harvesting said cultured cells; and 

(e) transplanting said cultured cells into said 
patient* 



19. A method for treatment of a patient having a 
hematopoietic disorder, comprising the steps of; (a) 
removing stem cells; (b) separating stem cells from other 
cells; (c) culturing said separated stem cells with a 

15 selected culture medium comprising the composition of claim 
11; 

(d) harvesting said cultured cells; and 

(e) transplanting said cultured cells into said 
patient. 

20 

20. A method of human gene therapy, comprising the 
steps of; 

(a) removing stem cells from a patient; 

(b) separating said stem cells from other cells; 
25 (c) culturing said separated stem cells with a 

selected culture medium comprising the hematopoietic protein 
of claim 1, 2, or 3 ; 

(d) introducing DNA into said cultured cells; 

(e) harvesting said transduced cells; and 

30 (f) transplanting said transduced cells into said 

patient. 
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21. A method of human gene therapy, comprising the 
steps of; 

(a) removing stem cells from a patient; 
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(b) separating said stem cells from other cells; 

(c) culturing said separated stem cells with a 
selected 

media comprising the composition of claim 9; 
5 (d) introducing DNA into said cultured cells; 

(e) harvesting said transduced cells; and 

(f) transplanting said transduced cells into said 
patient. 

10 22. A method of human gene therapy, comprising the 

steps of; 

(a) removing stem cells from a patient; 

(b) separating said stem cells from other cells; 

(c) culturing said separated stem cells with a 
15 selected 

media comprising the composition of claim 10; 

(d) introducing DNA into said cultured cells; 

(e) harvesting said transduced cells; and 

(f) transplanting said transduced cells into said 
20 patient. 

23. A method pf human gene therapy, comprising the 
steps of; 

(a) removing stem cells from a patient; 
25 (b) separating said stem cells from other cells; 

(c) culturing said separated stem cells with a 
selected 

media comprising the composition of claim 11; 

(d) introducing DNA into said cultured cells; 
30 (e) harvesting said transduced cells; and 

(f) transplanting said transduced cells into said 
patient. 
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24. A method of claim 14, 15, 16, 17, 18, 19, 20, 21, 
22 or 23 wherein said stem cells are isolated from 
peripheral blood. 
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